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"ost  military  personnel  planning  models  are 
deterministic  steady-state  models.  This 
Note  examines  the  impact  of  various  types 
of  uncertainties  on  proiections  of  force 
structures  using  a  Markov  flow  model  of  the 
first-ters  force.  In  particular,  it 
addresses  the  impact  of  uncertainties 
related  to  the  supply  of  enlisted  Air  Force 
personnel  (stav/leave  decisions  by  or  about 
individual  airmen,  the  makeup  of  accession 
cohorts,  retention  rate  estimation,  and 
recruiting  shortfalls)  on  force  planning 
factors  such  as  accession  requirements, 
reenlistmeat  requirements,  and  personnel 
costs.  The  analysis  indicates  that 
proiections  of  many  force  characteristics 
can  involve  sizeable  uncertainties. 
Individual  stay/leave  decisions  comprise 
the  larqest  source  of  this  uncertainty. 
Another  potentially  larqer  contributor  is 
uncertainty  in  the  proportion  of  accession 
requirements  that  can  actually  be  met. 
Oncerta inties  teqardinq  estimates  of  flow 
rates,  while  important  in  proiecting  values 
for  certain  subsets  of  the  force,  appear  to 
conrtibute  little  to  uncertainty  in  overall 
force  characteristics. 
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PREFACE 


This  Note  was  prepared  as  part  of  Rand's  "Enlisted  Force  Manage¬ 
ment”  project.  The  project,  which  is  part  of  the  Project  AIR  FORCE 
Resource  Management  Program,  is  being  performed  for  the  Directorate  of 
Personnel  Plans,  Headquarters,  United  States  Air  Force.  The  purpose 
of  the  project  is  to  develop  the  specifications  for  an  enlisted  force 
management  planning  system  to  replace  the  Air  Force's  current  system 
known  as  TOPCAP  (Total  Objective  Plan  for  Career  Airman  Personnel). 

All  of  the  models  in  the  TOPCAP  system  are  deterministic — i.e., 
they  ignore  the  uncertainty  inherent  in  projecting  both  the  demand 
(requirements)  for  manpower  and  the  supply  of  personnel  that  will  be 
available  to  meet  the  demand.  Before  undertaking  serious  development 
ol  new  models,  the  authors  carried  out  an  investigation  of  the  degree 
of  uncertainty  implicit  in  personnel  flows.  The  investigation  was  to 
evaluate  the  need  for  incorporating  uncertainty  in  the  new  models,  and 
to  consider  alternative  ways  of  doing  so. 

This  Note  focuses  on  uncertainty  in  the  supply  of  personnel:  stay/ 
leave  decisions  of  airmen,  the  composition  of  accession  cohorts,  reten¬ 
tion  rates,  and  recruiting  shortfalls.  It  discusses  the  effect  of 
uncertainty  on  the  relationships  between  these  variables  and  such  work 
force  characteristics  as  accession  requirements,  reenlistment  require¬ 
ments,  and  costs.  The  analytical  tool  used  is  a  Markov  chain  model 
representing  flows  in  the  first-term  enlisted  work  force. 

This  Note  should  be  of  interest  to  personnel  planners  in  the  Air 
Force  and  the  other  armed  services,  as  well  as  to  analysts  developing 
models  for  use  in  analyzing  manpower  and  personnel  policies  in  botli 


the  public  and  the  private  sectors. 
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SUMMARY 

**> 

The  Air  Force's  TOPCAP  system  (Total  Objective  Plan  for  Career 
Airman  Personnel)  includes  a  nimber  of  models  that  describe  the  flow 
ot  people  through  the  enlisted  work  force.  These  models  are  deter¬ 
ministic  (e.g.,  they  ignore  the  uncertainty  implicit  in  personnel 
loss  projections)  and  most  of  them  are  steady-state  (i.e.,  they  ignore 
the  current  enlisted  personnel  "inventory"  and  its  evolutionary  possi¬ 
bilities).  In  this  Note  we  address  the  impact  on  work  force  struc¬ 
ture  uncertainty  of  various  factors  related  to  the  "supply”  of 
personnel:  stay/leave  decisions  by  or  about  individual  airmen,  the 

makeup  of  accession  cohorts,  retention  rate  estimation,  and  recruiting 
shortfalls.  Specifically,  we  analyze  the  impact  of  uncertainty  in 
these  random  quantities  on  work  force  characteristics  such  as  accession 
requirements,  reenlistment  requirements,  and  personnel  costs.  Further, 
these  relationships  are  examined  over  time.  We  also  discuss  methods 
for  improving  estimates  of  "flow  rates"  for  personnel  planning  models 
and  for  estimating  how  these  rates  will  change  under  altered  personnel 
policies.  The  intent  is  to  ascertain  appropriate  directions  for  ex¬ 
tending  Air  Force  personnel  planning  models,  with  particular  regard  to 
uncertainty,  rather  than  to  describe  the  actual  amounts  of  uncertainty 
that  exist  or  to  demonstrate  or  evaluate  different  methods  of  dealing 
with  uncertainty. 

We  evaluate  the  extent  of  uncertainty  in  projections  of  work  force 
structures  using  an  analytical  Markov  flow  model  that  focuses  on  the 
first-term  enlisted  work  force.  Our  analysis  indicates  that  projections 
for  many  work  force  characteristics  can  involve  sizeable  uncertainties. 
Two-standard  deviation  confidence  intervals  often  contain  values 
differing  10  to  AO  percent  from  corresponding  expected  (mean)  values. 
Individual  stay/leave  decisions  comprise  the  largest  source  of  this 
uncertainty.  Another  potentially  large  contributor  is  uncertainty 
in  the  proportion  of  accession  requirements  that  actually  can  be  met. 
Uncertainties  regarding  the  mix  of  people  that  can  be  accessed  and 
regarding  estimates  of  flow  rates,  while  important  in  projecting 
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values  for  certain  subsets  of  the  work  force,  appear  to  contribute 
less  to  uncertainty  in  overall  work  force  .  liar.icter  ist  ics. 

Since  uncertainties  in  project  ini’,  t  tie  values  of  these  character¬ 
istics  can  be  substantial,  there  may  also  be  substantial  uncertainty 
in  predicting  the  effects  of  pol  icymuki-.  -  '  decisions.  This  leads  to  the 
quest  ion  of  assessment  of  risk--th.it  is,  tho  probl#5ii  of  determining  how 
far  off  mean  value  calculations  are  likelv  to  be  or  of  determining  the 
likelihood  of  certain  undesirable  events  (e.g.,  unusually  large 
accession  quantities  or  required  reenlistment  rates).  Wo  conclude 
that  if  "protection"  from  undesirable  events  is  important,  it  tan  be 
obtained  by  adding  to  determinist  io  flow  models  constraints  determined 
using  stochastic  post-processors  that  could  compute  the  approximate 
probabilities  of  certain  events  and/or  of  actual  results  differing 
from  mean  value  estimates  by  specified  amounts. 

We  also  recommend  that  improved  procedures  be  developed  for 
estimating  probabilistic  parameters  in  personnel  flow  models — e.g., 
loss  rates.  Improved  methods  should  provide  consistent,  interpretable, 
and  parsimonious  sets  of  parameters  for  estimating  flow  rates,  they 
should  incorporate  time  series  data  (in  order  to  detect  underlying 
trends),  they  should  include  "environmental"  data  such  as  occupational 
categories  and  corresponding  civilian  economic  conditions,  and  they 
should  admit  to  statistical  goodness-of-fit  procedures. 

Finally,  we  recommend  that  recently-developed  retention  decision 
models  (for  the  Air  Force  Officer  Corps)  be  revised  and  extended  to 
predict  how  the  flow  behaviors  for  the  various  categories  of  enlisted 
personnel  will  change  if  management  "control"  policies  such  as  compen¬ 
sation,  promotion  opportunity,  educational  benefits,  or  retirement 
programs  are  changed. 
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1 .  I NTRUDUCT 1  ON 

Among  the  primary  models  anployed  in  planning  and  programming 
for  the  enlisted  component  of  the  Air  Force  work  force,  manv  are 
deterministic  models  of  personnel  flow.  These  models  are  part  of  the 
TOPUAP  System  (Total  Objective  Plan  for  Career  Airman  Personnel)  and 
include,  tor  example,  the  Objective  Force  Model  ("OBKOR"),  the  Ai  m:an 
Force  Steady  St;ite  Model  ("the  Static  Model"),  the  Promotion  Fl<'w 
Model  ("the  Dynamic  Model"),  the  Five  Level  Redistribution  Program. 
("FLRP")  ,  tlie  Career  Progression  Croup  Model  ("CPC  Static"),  and 
the  Airman  Skill  Force  Model  ("ASKIF  11").  Several  of  these  models 
are  stitic  (i.e.,  steady-state);  they  represent  the  work  force 
structure  that  should  eventually  develop  if  management  policies, 
manpower  requirements,  retention  behavior,  upgrade  and  promotion 
rates,  etc.,  remain  unchanged.  In  addition,  practically  all  of  these 
models  treat  only  the  career  portion  of  the  enlisted  work  force — 
those  individuals  serving  beyond  their  initial  enlisted  term  of 
service  (which  usually  lasts  four  years). 

In  discussing  possibilities  for  the  form  and  structure  of 
extensions  to  the  capabilities  represented  in  the  TOPCAP  models, 
especially  with  regard  to  improving  analysis  capabilities  focusing 
on  the  first-term  work  force,  analysts  at  Rand  and  in  the  Air  Force 
jointly  agreed  that  uncertainty  must  be  considered  directly.  Many 
of  the  inputs  (e.g.,  retention  rates)  and  virtually  all  of  the  outputs 
of  such  models  (e.g.,  reenlistment  rates  and  annual  recruitment  pro¬ 
jections)  are  subject  to  uncertainty — often  because  they  must  be  esti¬ 
mated  using  sample  data  and/or  because  they  may  depend  on  future 
decisions  made  by  or  about  individual  airmen.  (Depending  on  the  use 
of  the  information,  stay/leave  decisions  made  by  many  thousands  or 
perhaps  only  a  few  airmen  may  be  of  interest.) 

Uncertainty  warrants  special  attention  for  two  primary  reasons, 
one  technical  and  the  other  having  both  technical  and  decisionmaking 
ramifications.  The  first  reason  is  that  personnel  flow  models  often 
anploy  nonlinear  equations  to  relate  random  quantities.  For  example. 


"required"  first-term  reenlistment  rates  are  nonlinear  functions  of 
requirements  for  career  force  entrants,  accession  quantities,  and 
retention  rates.  The  expected  value  of  one  random  variable  in  such 
a  relationship,  unfortunately,  cannot  be  found  by  replacing  the  other 
random  variables  witli  their  expectations  and  solving  the  resulting 
equat ion — at  least  not  in  general.  We  determined  to  address  this 
"nonlinearity  of  expectations"  problem  before  beginning  to  develop 
further  deterministic  models  that  ignore  it.  The  second  reason  for 
examining  uncertainty  relates  to  possible  alternative  forms  for  stat¬ 
ing  decision  criteria  and  for  specifying  corresponding  model  objective 
and/or  constraint  functions.  For  example,  we  might  wish  to  consider 
management  options  only  if  they  provide  confidence  of  certain  events 
occurring — e.g.,  a  90%  chance  that  a  particular  career  progression 
group  (CPG)  would  require  a  reenlistment  rate  no  higher  than  45% 
in  1983.  Or  we  might  want  to  consider  a  policy  change  only 
if  it  is  likely  to  yield  results  subs t an t i a  1 1 v  different  from  the 
current  policy--*?.;;.,  if  A,,  and  A^.  represent  a  particular  (TF's 
accession  requirement  in  1983  under  alternative  policies  X  and  Y, 
respectively,  we  may  want  to  assure  a  90%  chance  that  A^  exceeds  A^, 
by,  say,  5  percentage  points.  Alternatively,  we  may  simply  want  to 
know  the  probability  that  our  expected-value  estimates  will  be  off  by 
particular  amounts — e.g.,  what  is  the  probability  that  our  estimate 
of  the  number  of  people  in  the  fourth  year  of  service  for  1984  is  off 
by  10%  or  more?  Assessment  of  uncertainty  is  obviously  essential  if 
information  of  this  type  is  to  be  provided  to  decisionmakers,  and 
underlying  mathematical  structure  is  clearly  affected  if  "chance 
constraints"  are  to  be  incorporated  in  planning  models. 

Since  these  concerns  about  uncertainty  arose  during  consideration 
of  first-term  planning  models,  much  of  this  Note's  discussion  refers 
to  aspects  ot  the  first-term  work  force.  Nevertheless,  the  concepts, 
recursive  equations,  etc.  can  be  generalized  easilv  to  include  the 
career  work  force  as  well. 

The  next  section  reviews  previous  research  regarding  uncer¬ 
tainty  in  personnel  flow  models.  We  then  address  the  sources 


of  uncertainty  in  personnel  flow  models,  develop  analytical  means 
for  evaluating  the  extent  of  these  uncertainties  using  a  Harkov  model , 
and  present  sample  descriptive  results  based  on  an  implementation  of 
the  model  that  focuses  on  the  first-term  force.  Identifying  appropriate 
categories  of  enlisted  personnel  and  the  flow  rates  among  the  categories 
is  critically  important  in  personnel  flow  models.  Section  4  addresses 
statistical  methods  for  discerning  categories  of  airmen  whose  re¬ 
tention  behaviors  (flow  rates)  differ  and  for  predicting  how  their 
behavior  would  change  under  altered  management  policies  (e.g.,  larger 
reenl istment  bonuses,  revised  compensation  tables,  improved  promotion 
opportunities,  or  altered  retirement  benefits).  We  conclude  with 
recommendations  for  how  uncertainty  should  be  incorporated  in  future 
personnel  flow  models  and  for  statistical/behavioral  modeling  work 
(related  to  personnel  "supply")  that  should  be  accomplished  and 
incorporated  in  improved  Air  Force  personnel  planning  and  programming 
models . 


2.  RELEVANT  PREVIOUS  WORK 


Personnel  flow  models  that  Incorporate  uncertainty  typically 
are  either  Markov  chain  models,  renewal  models,  or  simulation  models, 
Markov  chain  models  are  "push"  models  in  the  sense  that  flow  occurs 
due  to  natural  progression  from  one  state  (e.g.,  pay  grade  or  length 
of  service)  to  the  next,  and  due  to  the  inflicc  of  new  personnel 
into  the  system.  Renewal  models  are  "pull"  models  driven  by  the 
need  to  fill  vacancies.  Simulation  models  can  be  either  push  or 
pull  models  or  combinations  of  the  two. 

Markov  chain  models  assune  that  from  one  observation  to  the 
next  the  net  changes  in  state  exhibit  the  Markov  property.  That  is, 
the  probability  of  changing  from  one  state  to  another  depends  only 
on  the  current  state,  and  not  on  how  that  state  was  reached.  Such 
models  describe  how  changes  (transitions)  occur  between  states  from 
one  time  point  to  the  next  using  transition  probabilities  or  pro¬ 
portions.  They  generally  are  composed  of  three  components  (see, 
for  example,  Bartholomew  and  Forbes  [5]): 

o  A  description  of  how  flows  take  place  within  the  system 
(specified  by  transition  probabilities); 

o  A  description  of  how  attrition  occurs  from  the  system 
(specified  by  attrition  rates); 

o  A  specification  of  the  number  of  recruits  at  each  point  in 
time  and  the  allocation  of  recruits  to  different  states  or 
categories. 

The  two  primary  uses  of  a  Markov  chain  model  are  prediction 
of  future  behavior  of  the  system  (assuming  no  change  in  the  para¬ 
meters)  and  control  of  the  system  through  policy  changes  (e.g., 
by  altering  recruitment,  changing  promotion  rates,  or  expanding 
or  contracting  certain  categories).  Thus  the  problen  of  control 
arises  naturally  as  a  consequence  of  prediction. 

In  the  Markov  chain  model,  the  transition  rates  typically 
are  fixed,  while  the  numbers  of  people  in  different  states  (or 
categories)  change  jver  time.  In  contrast,  the  renewal  model  has 
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fixed  nunbers  of  people  In  the  different  states,  while  the  flows  are 
allowed  to  vary.  A  renewal  system  model  can  possess  awkward  mathe¬ 
matical  features,  but  simple  renewal  systems  can  be  handled  within 
the  Markov  chain  framework  (Bartholomew  [4]).  The  central  assunptions 
of  the  renewal  model  relate  to  wastage  flow  (i.e.,  losses,  or  departures 
from  the  work  force)  and  how  this  flow  depends  on  length  of  service.  Since 
for  the  first-term  enlisted  force  the  state  (category)  sizes  are  not 
fixed,  a  renewal  model  seems  less  appropriate  than  a  Markov  chain 
model.  The  renewal  model  would  be  more  appropriate  for  studying  a 
long-term  system,  such  as  the  career  force,  since  vacancies  play  a 
larger  role  in  career  force  management. 

Renewal  models  seem  inappropriate  in  our  context  for  other 
reasons  as  well: 

o  First-term  airmen  move  lock-step  through  their  first  term 
of  obligation  (their  promotion  and  upgrade  times  vary 
little) ; 

o  They  are  promoted  more  on  the  basis  of  length  of  service 

and  skill  qualification  than  on  the  availability  of  vacancies 
(vacancies  become  more  important  at  the  higher  enlisted  grades 
and  in  officer  grades); 

o  The  primary  Air  Force  policy  controls  applied  to  the  first- 
term  work  force  concern  enlistment  and  reenlistment,  and 
these  depend  mainly  on  length  of  service  and  overall  numer¬ 
ical  requirements  rather  than  on  vacancies; 

o  The  main  vacancy  aspects  of  first-term  work  force  modeling 
concern  the  needs  for  (a)  career  force  entrants  to  sustain 
a  desirable  and  stable  career  work  force  structure  and  (b) 
raw  recruits  to  achieve  an  overall  work  force  of  specified 
size — and  both  of  these  can  be  incorporated,  as  we  shall 
see,  in  a  Markov  model. 

In  the  steady  state,  it  is  difficult  to  distinguish  between 
Markov  chain  and  renewal  models  from  state  size  and  flow  rate  data 
alone.  This  is  because  state  sizes  achieve  equilibrium  in  the 
Markov  chain  model  and  flow  rates  achieve  equilibrium  in  the  renewal 
model.  Thus,  as  described  in  Bartholomew  [4],  in  the  steady  state 
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either  model  performs  equally  well  with  regard  to  expected  values. 

The  distinction  may  become  crucial,  however,  in  modeling  the  transient 
behavior  of  the  system.  Moreover,  the  Markov  chain  model  permits  much 
easier  evaluation  of  standard  deviations  (a  measure  of  uncertainty) 
for  the  random  quantities  of  interest. 

A  third  method  for  incorporating  uncertainty  in  personnel  flow 
modeling  anploys  simulation  models  (or  Monte  Carlo  models,  as  they  are 
widely  termed).  Simulation  models  are  typically  employed  when  computa¬ 
tionally  more  efficient  methods  prove  inadequate  in  representing  the 
details  of  system  operation.  A  recent  example  of  a  simulation 
approach  to  personnel  flow  modeling — indeed  to  Air  Force  work  force 
modeling — is  the  Integrated  Simulation  Evaluation  Model  [14]. 

This  model,  however,  has  as  its  primary  aim  the  prediction  of  central 
tendencies — expected  numbers  of  accessions,  promotions,  transfers, 
etc. — rather  than  of  variations  around  expectations.  Simulation 
modelers  quite  often  ignore  such  uncertainty,  although  there  is  a 
substantial  literature  regarding  variance  reduction  techniques  (see, 
e.g.,  Fishnian  [10]).  In  contrast,  our  aim  is  to  examine  the  extent 
and  sources  of  uncertainty  or  "spread"  that  are  ignored  in  widely- 
employed  deterministic  personnel  flow  models.  Further,  the  relative 
simplicity  of  the  flows  in  typical  personnel  planning  models  makes 
the  computation  expense  of  full-fledged  simulation  techniques  un¬ 
necessary.  As  will  be  seen  later,  however,  we  do  resort  to  simulation 
to  augment  our  analytic  Markov  model  in  one  situation  (in  order  to 
consider  uncertainty  in  estimates  of  the  transition  and  accession 
probabilities)  because  the  analytic  stochastic  model  simply  becomes 
too  complex.  We  find  that  combining  analytic  and  simulation  models 
to  represent  uncertainty  achieves  a  desirable  degree  of  econony  in 
both  analysis  and  computation  Lime. 

The  bulk  of  the  open  literature  on  personnel  flow  models  anploys 
the  Markov  chain  structure  but  ignores  its  implicit  uncertainty. 

Instead,  the  focus  is  on  expected  values,  and  most  treatments  could 
more  accurately  be  termed  deterministic  fractional  flow  models  than 
Markov  chain  models.  C.rinold  and  Marshall  [12],  for  example,  in  a 
longitudinal  comparison  of  two  cohorts  of  U.S.  Marine  Corps  entrants. 
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note  a  "significant  divergence"  in  the  numbers  remaining  after 
several  years.  Probabilistically,  however,  that  divergence  should 
not  be  thought  uncommon — i.e.,  it  could  not  be  characterized  as 
"statistically  significant." 

The  most  notable  publications  regarding  uncertainty  in  personnel 
flow  models  are  by  Bartholomew  (see,  e.g..  Refs.  1  through  6). 

Mainly,  however,  Bartholomew  merely  catalogs  the  various  sources  of 
uncertainty  and  suggests  circimstances  where  they  are  likely  to  be 
important;  he  generally  does  not  show  how  to  evaluate  the  extent 
of  uncertainty.  But  he  does  note  ([5],  p.  110)  that  stochastic  var¬ 
iation  can  be  quite  large  and  its  analysis  difficult: 


the  errors  in  forecasts  are  likely  to  be  quite  large — 
the  variances  of  the  predictions  being  of  the  s^ime  order 
as  the  predicted  values  themselves.  On  top  of  this  there 
is  a  further  source  of  error  arising  from  the  fact  that 
inmost  applications  some,  at  least,  of  the  parameters 
have  to  be  estimated....  This  source  can  give  rise  to 
errors  of  a  similar  magnitude  to  the  random  error 
arising  from  the  stochastic  assumptions  of  the  model. 

This  takes  no  account  of  the  uncertainties  of  yet 
another  kind  arising  from  changes  in  the  parameters 
which  may  occur  during  the  forecast  period.  The 
whole  question  of  how  to  cope  with  uncertainty  in 
manpower  planning  is  a  complex  one.... 


Finally,  we  note  that  previous  work  is  concerned  more  with 
long-term  (steady-state)  than  near-term  (dynamic)  aspects 
of  work  force  modeling.  Air  Force  enlisted  personnel  management 
is  conducted  in  a  notably  dynamic  environment,  however,  so  our 
analysis  examines  the  nature  and  size  of  uncertainty  in  this 
setting . 
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3.  A  RUDIMENTARY  STOCHASTIC  PERSONNEL  FLOW  MODEL 


To  ascertain  the  significance  of  uncertainty  in  nonlinear 
relationships,  its  magnitude,  and  the  contributions  of  its  various 
sources,  we  have  developed  a  basic  Markovian  flow  model  that 
represents  a  simple,  first-term  work  force.  The  sources  of 

uncertainty  considered  (all  of  which  are  ignored  in  current  Air  Force 
personnel  planning  models)  are: 

1.  Attrition  (Retention)  Behavior  —  uncertainty  due  to 

the  fact  that  the  work  force  consists  of  individuals,  and 
the  numbers  of  these  individuals  who  elect  to  leave  the 
service  or  whose  service  the  Air  Force  elects  to  terminate 
cannot  be  known  precisely  in  advance.  To  illustrate,  a 
projected  first-year  loss  rate  really  represents  a 
probability  that  an  individual  recruit,  chosen  at 
random,  will  not  complete  his  or  her  entire  first 
year  of  obligated  service. 

2.  Accession  Mix  —  uncertainty  due  to  the  fact  that  the 
proportion  of  new  recruits  possessing  particular  character¬ 
istics — e.g.,  designated  according  to  educational  background, 
sex,  race,  marital  status,  or  mental  aptitude  (characteristics 
that  may  correlate  with  retention  behavior,  productivity, 
and/or  cost) — cannot  be  known  precisely  in  advance.  For 
example,  a  valuable  input  to  a  first-term  personnel  flow 
model  might  be  the  fraction  of  recruits  having  at  least  a 
high  school  education;  in  fact,  this  fraction  estimates  the 
probability  that  a  new  recruit,  chosen  at  random,  will  have 
completed  high  school. 

3.  Parameters  —  uncertainty  due  to  the  fact  that  the  values 
employed  to  represent  retention  and  accession  probabilities 
are  themselves  only  estimates.  Depending  on  the  size  and 
nature  of  the  historical  data  sample  used  to  estimate 
these  probabilities,  there  may  be  considerable  uncertainty 


regarding  their  actual  values.  Further,  prediction  of 
the  values  of  these  parameters  under  altered  management 
policies  (e.g.,  revised  compensation  patterns  or  pro¬ 
motion  opportunities)  introduces  additional  uncertainty. 

4.  Costs  —  uncertainty  due  to  the  fact  that  expenditures 
to  support  different  individuals  within  the  same  general 
category  may  vary.  For  example,  within  a  particular  year 
of  service  and  within  the  same  occupational  specialty, 
individuals'  pay  and  benefits  may  differ  because  they 
hold  different  pay  grades,  have  different  family  situations, 
experience  different  health  problems,  etc. 

While  there  may  be  numerous  other  sjcf..es  of  uncertainty,  important 
in  some  situations  ''e.g.,  differences  in  job  c, r  task  performance  capa¬ 
bilities  among  apparently  similar  indi'. 'duals) ,  these  are  not  considered 
here. 

We  should  note  early  that  the  model  is  analytically  based.  It 
employs  time-recursive  relationships  among  key  work  force  characteristics; 
individual  behavior  is  not  simulated  in  a  Monte  Carlo  sense.  Rather, 
probabilistic  group  behavior  is  considered.  We  resort  to  simulation 
only  when  necessary — namely  in  addressing  the  contributions  of  uncertainty 
implicit  in  underlying  parameter  estimates. 

Recent  consensus  also  emphasizes  the  importance  of  dynamics  in 
personnel  flow  models.  Most  current  personnel  models  are  static 
rather  than  dynamic;  hence  they  can  yield  only  steady-state  results 
concerning  composition  of  the  work  force,  attrition,  and  associated 
costs.  In  contrast,  dynamic  models  can  provide  these  results  along 
with  information  about  how  long  it  may  take  to  achieve  a  steady-state 
(static)  distribution  and  information  about  the  behavior  of  the  system 
along  the  way.  Our  model  has  a  dynamic  structure,  permitting  us  to 
investigate  how  uncertainty  varies  with  time. 

The  model  developed  here  incorporates  several  simplifying  assump¬ 
tions,  notably: 
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1.  A  Four-Year  Term  of  Service.  Although  airmen  may  enlist  for 
either  four  or  six  years  and  although  other  enlistment  terms  are  cer¬ 
tainly  possible,  we  treat  a  four-year  first-term  enlistment  because 
that  is  currently  the  primary  mode  and  because  our  model  is  designed 
for  exploratory  rather  than  descriptive  use. 

2.  Spec ial ty-Spec if ic  Cat ego r izat ion .  Because  (1)  we  expect  uncer¬ 
tainty  to  be  more  significant  when  smaller  personnel  groups  are  con¬ 
sidered,  (2)  manpower  requirements  typically  are  specified  for 
individual  occupations,  and  CD  it  is  computationally  simpler 

to  ignore  the  crosstraining  and  direct  assignment  channels  which  can 
move  individuals  from  one  occupation  to  another,  we  proceed  as  if 
we  are  considering  only  a  single  occupation.  [Note:  as  structured, 
the  model  certainly  can  he  employed  to  represent  larger  personnel 
aggregations,  but  at  the  expense  of  accuracy  in  representing  individual 
occupat ions . ] 

3.  No  Cross-Flow  Among  Categories.  Again,  although  transitions 
such  as  changes  in  marital  status,  number  of  dependents,  skill  level, 
pay  grade,  etc.,  certainly  occur  for  individuals  within  the  first- 
term  enlisted  work  force,  we  exclude  them  here  in  the  interest  of 
simplicity.  The  model  can  be  extended  to  include  such  transitions 

in  a  fairly  straightforward  manner. 

4.  Fixed  Work  Force  Size.  Because  our  intent  is  to  investigate 
the  uncertainty  inherent  in  personnel  flow  models  and  not  the  un¬ 
certainty  implicit  in  (often  fluctuating)  manpower  requirements,  we 
treat  the  total  size  of  the  first-term  work  force  being  considered 
as  constant.  It  would  be  a  straightforward  extension  to  allow  this 
size  to  vary  over  time  due  to  planned  requirements  changes.  One 
extension  we  do  incorporate  later  is  the  possibility  of  recruiting 
shortfalls;  that  is,  although  the  required  number  of  people  may  not 
change,  the  number  having  the  proper  qualifications  who  can  actually 
be  recruited  may  be  inadequate  to  bring  end-strength  up  to  the 

des bred  level  . 

These  seemingly  restrictive  assumptions  clearly  can  be  relaxed 
if  it  is  desired  to  build  a  model  embodying  more  of  the  detailed 
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reality  of  possible  personnel  flows.  But  we  believe  the  present 
model  can  provide  the  necessary  insight  into  the  magnitude  and 
importance  of  uncertainty  in  personnel  flow  models. 

Here  we  will  focus  on  determining  means  and  variances  for 
important  work  force  characteristics  such  as  accession  quantities, 
year-group  sizes,  required  reenlistment  rates,  and  costs.  We 
focus  on  means  because  of  the  potential  nonlinearity  problem 
and  because  they  are  the  conventional  indicators  of  central 
tendency.  Variances  (and  standard  deviations),  correspondingly, 
are  the  usual  indicators  of  uncertainty  or  "spread,"  and  are 
typically  more  tractable  computationally  than  alternative  measures 
of  uncertainty.  Ideally,  of  course,  we  should  obtain  entire  pro¬ 
bability  distributions  for  the  quantities  of  interest,  but  that 
seems  neither  necessary  nor  practical  for  this  exploratory  analysis. 

The  remainder  of  this  section  describes  the  basic  model 
more  explicitly;  detailed  mathematics  are  relegated  to  Appendix  A. 

We  begin  by  describing  the  model's  basic  inputs:  attrition  rates, 
accession  mix,  etc.,  and  their  probabilistic  interpretations.  Then 
follows  a  brief  discussion  of  the  uncertainty  associated  with 
important  parameters  in  the  model  and  how  it  is  represented.  The 
third  subsection  describes  the  model’s  computed  outputs,  and  the 
last  subsection  presents  example  results  and  relevant  observations 
and  conclusions. 


3«1.  Model  Inputs 

In  this  section  we  give  a  brief  overview  of  the  model  and  describe 
its  basic  inputs.  The  fundamental  model  inputs,  each  described  below, 
are: 

o  Subdivisions  of  the  work  force,  characterized  by  year  of 

service  (YOS)  i  and  category  m. 
o  Work  force  size, 

o  Attrition  rates, 

o  Accession  mix. 


o  Costs. 

o  Planning  horizon. 
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Subdivisions  of  the  Work  Force.  For  simplicity  we  assune  that 
all  airmen  enlist  for  a  four-year  period.  Thus,  for  any  calendar  year 
t,  an  airman  belongs  to  year-of-service  (YDS)  i  where  1  _<  i  <  4  . 

Individual  airmen  can  be  categorized  according  to  any  number  of  char¬ 
acteristics  such  as  education  (e.g.,  high  school  graduate),  race,  marital 
status,  mental  aptitude  test  scores,  pay  grade,  AFSC,  etc.  Transitions 
between  categories  are  not  incorporated  in  the  current  model  or  computer 
program  (although  they  could  be  included  in  a  fairly  straightforward  exten¬ 
sion).  Thus,  for  current  purposes,  individual  characteristics  which  may  changi 
over  time  (e.g.,  skill  level,  pay  grade,  or  marital  status)  should  not 
be  considered  as  category-distinguishing  characteristics.  Within  the 
model,  airmen  in  each  calendar  year  t  are  distinguished  by  their 
YOS  i  (1  _<  i  _<  4)  and  category  m  (1  _<  m  M)  .  Notationally , 
we  let  N^m(t)  =  number  of  airmen  in  YOS  i  and  category  m  ,  for 
calendar  year  t.  This  number  Is  generally  a  random  variable. 

Work  Force  Size.  In  this  model  it  is  assuned  that  the  work  force 

is  kept  constant  at  size  N  .  Thus,  we  assume  initially  that  the  Air 
Force  can  enlist  as  many  airmen  as  necessary  to  keep  its  force  size 
fixed.  N  is  a  variable  whose  value  is  chosen  by  the  decisionmaker. 

For  each  calendar  year  t  ,  we  have 


N 


M 

E 


4 

E 


m*=l  i=l 


Ni»(t) 


We  also  treat  in  our  examples  and  in  Appendix  A  a  case  where  the  work 
force  size  is  a  random  variable — in  particular,  we  admit  the 
possibility  of  recruiting  shortfalls. 

Attrition  Rates.  Attrition  is  treated  by  supposing  that  each 
individual  in  YOS  i  and  category  m  stays  in  the  service  another 
year  with  probability  p^m  (so  that  the  probability  of  attrition 

Air  Force  Specialty  Code,  basically  an  occupational  designator. 
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bet  ween  YOS  1  and  i  +  1  is  1  -  p.  ).  The  individual  stay/leave 

lzn 

decisions,  whether  made  by  the  Air  Force  or  the  airman,  are  assumed 
to  occur  independently  of  one  another.  Thus,  for  a  particular  per¬ 
sonnel  group  (say  YOS  i  and  category  m) ,  the  number  remaining 
from  one  year  to  the  next  has  the  following  conditional  binomial 
probability  distribution: 


P(Ni+l,n,(t  +  »  '  k  I  *  »> 


Accession  Mix.  In  treating  accessions  into  the  first  year  of 
service,  the  model  assumes  that  we  first  observe  the  total  number 
of  people  leaving  the  force  from  each  year  of  service  and  each 
category.  Hence,  the  number  of  people  needed  to  enter  the  first  YOS 
in  calendar  year  t  +  1  to  keep  the  force  size  fixed  at  N  is 


4  H 

L(t  +  1)  -  N  -  E  E  N.  (t  +  1)  . 
j=2  m=l  jm 


Equivalently,  enough  airmen  must  be  enlisted  to  ensure  that 


M 

Z  N  (t  +  1)  =  L(t  +  1)  . 


Now  each  N^m(t  +  1),  1  _<  m  _<  M  is  a  random  variable  whose 
distribution  must  be  determined.  We  consider  two  methods  for 
modeling  these  random  variables : 

o  Fixed  proportion  model.  Here  it  is  assumed  that  +  1) 

is  a  fixed  fraction  of  L(t  +  1)  .  Let  n„  . 

}4  1  M  be 

positive  numbers  such  that  Z  it  =  1  .  (These  are  the 

.  m 
m=l 

accession  mix  parameters).  If  L(t  +  1)  takes  the  value 
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k  ,  this  approach  assumes  N,  (t  +  1)  =  n  k. 

im  m 

o  Multinomial  model.  In  this  perhaps  more  realistic  model,  we 

take  the  parameters  nM  as  representing  probabilities. 

In  particular,  we  assume  that  N,,(t  +  1),...,  N,w(t  +  1) 

11  In 

are  jointly  multinomially  distributed  with  parameters 
L(t  +  1)  and  .  The  conditional  probability 

density  function  is 


P  (Nn(t  +  1)  =  n1,...,  N1M(t  +  1)  =  i^lLCt  +  1)  =  k) 


=  f(n1’---’nM)  nx!  n2!  ...  r^! 


ni  "m 
rwi  •  •  •  7'm 


where  n^,...,  n^  are  nonnegative  integers  such  that 

M 

I  n  =  k  . 
m=l 

Thus,  given  L(t  +  1),  the  variable  N^(t  +  D  is  not  random  using  the 

fixed  proportion  model,  but  it  is  random  using  the  multinomial  model. 

However,  since  L(t  +  1)  itself  is  random,  N,  (t  +  1)  is 

lm 

actually  random  in  both  models. 

Costs .  Since  the  expense  of  maintaining  the  work  force  is  of 

obvious  interest,  cost  values  are  included  as  model  inputs.  A  cost 

C^m(t)  is  associated  with  each  YOS  i  and  category  m  during 

planning  year  t.  C t )  is  the  yearly  cost  for  one  airman  with 

characteristics  (i,  ra)  .  Since  in  fact,  different  airmen  in  the 

same  class  (i,  m)  may  be  compensated  differently,  depending  on 

marital  status,  pay  grade,  etc.,  our  model  treats  C.  (t)  as  a  random 

im 

variable.  The  question  of  uncertainty  in  costs  is  treated  in  greater 
detail  in  the  next  section. 

Planning  Horizon.  Since  the  model  is  dynamic  as  well  as 
stochastic,  the  length  of  the  planning  horizon  is  an  input  value. 
Thus,  the  model  may  be  used  to  answer  questions  concerning  enlistment 
quantities,  reenlistment  rates,  costs,  etc.,  five  years  from  now, 
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ten  vears  from  now,  etc.  ll  no  policy  or  behavioral  changes  occurred, 
the  long-run  solutions  would  eventually  converge  t.o  the  steady-state 
(equil  ihr  i  tiu  )  answers. 


1  .2 .  Treatment  of  Uncertainty  in  Input  Parameters 

We  now  address  the  question  of  modeling  the  uncertainty  in  several 

of  the  parameters  just  mentioned:  the  retention  rates  (the  p  '  s)  . 

- — - - - lm 

accession  mix  (the  ••  '  s  t  and  costs  (the  C.  (t)'s).  The  values  of 

- - - - — - -  m  -  in 

these  parameters  are  uncertain  because  they  can  be  estimated  onlv  from 

historical  data,  and  historical  estimates  themselves  possess  some  innate 

variability.  One  method  ot  treating  uncertainty  in  these  parameters  is 

explained  more  fully  in  Appendix  A.  There  we  hypothesize  that  the 

parameter  estimates  are  based  on  one  year's  observation  of  a  work  force 

of  size  N,  the  same  as  we  assume  for  future  work  force  sizes.  (This  is 

convenient  for  exploratory  computational  purposes  and  is  consistent 

with  Air  Force  use  of  the  most  recent  year's  data  for  estimating  future 

retention  rates,  etc.)  We  denote  the  number  of  people  from  this 

historical  data  set  in  vear  of  service  i  and  categorv  m  as  n.  .  The 

lm 

estimate  6.  of  the  retention  probability  p.  has  a  variance  inversely 
im  K  ■  *im 

proportional  to  n.  .  The  estimate  r  of  the  accession  mix  parameter 
im  m 

n  has  variance  inversely  proportional  to  7.  n,,  . 
m  -ci  k  lk 

For  our  computer  code,  the  estimate  of  p.  has  been  modeled  as 

im 

if  it  had  a  normal  distribution  with  mean  p.  and  variance  p.  (1-p.  ) /n .  . 

im  im  im  im 

Future  refinements  of  the  model  should  consider  anv  dependence  among 
the  estimates  of  the  retention  rates  and  should  model  the  distribution 
of  their  estimates  more  precisely.  (The  normal  approximation  is 
generally  good  for  large  n^,  but  not  for  small  n  .)  The  assumption 
that  the  estimates  of  p^  and  p^„  for  m  t  l  are  independent  is  probably 
fairly  reasonable,  since  events  concerning  individuals  in  different 
categories  are  likely  to  be  independent  of  one  another.  The  assumption 
that  the  estimates  of  and  m  are  independent  is  perhaps  less 

reasonable,  and  this  assumption  merits  further  investigation.  (This 
is  beyond  the  scope  of  this  study,  but  can  be  considered  within  the 
context  of  the  statistical  behavioral  models  discussed  in  Section  4.) 
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The  estimates  of  cannot  be  treated  as  if  thev  were* 

1  M 

independent,  since  ^  +  ...  +  t'm  =  ^  ‘  *or  reasons  described  in 

Appendix  A,  the  distribution  of  the  estimates  of  (t  ^  , .  .  .  ,  !i  )  can  be 
modeled  as  a  Dirichlet  distribution.  In  tliis  case,  the  variance  oi 

the  estimate  of  it  is  inversely  proportional  to  I  and  the  correla¬ 
tion  between  the  estimates  of  it  and  it.,  m  4  l,  is  negative. 

m  it 

There  also  can  be  considerable  uncertainty  in  the  costs  Cim(t) 
associated  with  the  different  classes  (i,  m) .  This  is  because  an 
individual  selected  at  random  from  class  (i,  m)  may  be  payed  con¬ 
siderably  more  (or  less)  than  another  individual  from  the  same  class, 
depending  on  the  pay  grade  each  holds,  the  number  of  dependents  each 
has,  etc.  Our  model  assumes  that  the  variances  Tjjn  im  “  Var  tCim(t)] 
and  the  covariances  ^  =  Cov  [Cim(t),  C  (t)]  have  the  form 

Tim,im  =  8(1)2 
Tim,j*  "  Pg(1)  8(j) 

where  g(-)  is  some  suitably  chosen  positive  function,  and  0  _<  p  _<  1 
Thus  the  correlation  between  two  classes  (i,m)  and  (j,S.)  is  p  . 

If  uncertainty  in  cost  turns  out  to  be  important,  more  work  will  be 
required  to  determine  a  more  realistic  formulation  for  the  variances 
and  covariances  of  its  components. 

The  random  variables  X ],...,  have  a  Dirichlet  distribution 
with  parameters  it]  >  0,...,  ^  0  if  their  joint  density  has  the 

form 

r(ai  +  +  c^)  ari  Vi 

(xi”"’  XM)  "  r(ai)  ...  r(aM)xi  *M 


M 

r(.)  is  the  gamma  function,  each  x  >0,  and  Z  x  =  1  . 

m  ,  m 

m=l 


where 
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3.3.  Computational  Outputs 


The  primary  computational  outputs  ot  our  model  are: 


1.  The  means  (expected  values)  and  covariances  of  the  numbers 
of  individuals  in  the  various  classes  (i,  m)  for  each 

year  t  (these  include  accession  quantities);  i.e.,  E(N^(t)) 

and  Co v  [ N .  ( t ) ,  N .  ,  ( t )  ]  . 

lm  j 

2.  The  mean  value  and  standard  deviation  of  the  number  of 
accessions  for  each  year  t  ;  i.e.. 


3.  The  mean  value  and  standard  deviation  of  the  required 
overall  reenlistment  rate  for  each  year  t.  We  shall 
assume  that  the  desired  reenlistment  quantity,  say  R,  is 
known.  This  value  may  represent  the  number  of  people 
required  to  enter  the  career  portion  of  the  work  force  in 
order  to  achieve  some  personnel  structure  there  (e.g.,  the 
"objective"  work  force  structure  identified  using  OBFOR  for 
a  particular  CPC).  For  convenience,  we  take  the  reenlist¬ 
ment  target  R  as  proportional  to  the  fixed  first-term  force 

size  N;  i.e.,  we  use  R  =  cN  or  R  =  c  N,  respectively, 

m  m 

depending  on  whether  we  are  considering  an  overall  or  a 

category-specific  reenlistment  rate.  Consequently  the 

required  reenlistment  rate  is  defined  as  the  quotient  of 

the  reenlistment  target  R  (fixed)  and  the  total  size  of  the 

M 

fourth-year  group  (random),  i.e.,  u)  =  cN/  I,  N.  (t),  and 

t  m=l  4m 

we  wish  to  compute  the  mean  Ew  and  standard  deviation 

1/2  c 

{Var  a)  }  of  the  required  reenlistment  rate. 

f 

For  a  random  variable  X,  we  will  use  both  EX  and  E(X)  for  the 
mean  of  X,  and  both  Var  X  and  Var(X)  for  its  variance. 
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4.  The  mean  value  and  standard  deviation  of  the  reenlistment 
rate  for  each  category  m,  for  each  year  t;  i.e., 

E  w  and  {Var  (u  ^  , 

mt  mt 


where 


U) 


mt 


c 

m 


5.  The  mean  value  and  standard  deviation  of  the  cost  of  the 
first-term  force  for  each  year  t  ;  i.e., 

1  /? 

E  C(t)  and  {Var  (C(t) ) }  7  , 


where 


C(t) 


4  M 
I  E 
i=l  m=l 


C. 

im 


(t)  N 


im 


(t)  . 


The  model  can  evaluate  these  quantities  for  the  cases  where  the 

p.  's  are  fixed  or  random,  the  v  's  are  fixed  (both  the  fixed 
im  m 

proportion  model  and  the  multinomial  model)  or  random,  and  the  costs 

C.  (t)  are  fixed  or  random, 
im 


3.4.  Results  and  Observations 

In  this  section  we  present  results,  conclusions,  and  recommen¬ 
dations  reached  from  exercising  a  computerized  version  of  this  model — 
using  a  range  of  assunptions  and  input  parameters. 

3.4.1,  Parameter  Inputs 

For  exploratory  purposes,  we  use  total  force  sizes  of  N  =  100, 

500  and  1,000  and  a  planning  horizon  T  of  10  years.  For  illustration, 
we  assune  that  personnel  are  classified  according  to  two  characteristics 
(e.g.,  A  =  educational  background  and  B  =  sex),  each  characteristic 
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having  two  levels  or  values  (e.g.,  nongraduate  or  high-school  graduate 
and  male  or  female,  respectively).  Thus  personnel  are  subdivided  into 
M=4  categories  or  cells.  Table  1  presents  the  list  of  accession  mix 
parameters. 


Table  1 


ACCESSION  MIX  PARAMETERS 


A1 

0.40 

0.05 

(v 

(V 

A  , 

0.45 

0.  10 

(  ,> 

<v 

Thus  =  0.40,  =  0.05,  =  0.45,  =  0.10 — i.e.,  10%  of  new 

accessions  have  the  second  "value"  for  characteristic  A  and  the 
second  "value"  for  characteristic  B.  (Throughout ,  the  data  we 
employ  are  only  indicative.  The  intent  is  to  explore  the  potential 
importance  of  uncertainty,  not  to  measure  it  precisely.) 

The  retention  probabilities  p.  are  listed  in  Table  2.  To 

lm 

illustrate,  p^  =  -95  reflects  an  estimate  that  an  average  of  95% 
of  the  personnel  in  Category  2  who  complete  3  years  of  service  will 
complete  their  4th  year  of  service. 


Table  2 

RETENTION  PROBABILITIES 


Cat ogory 


YOS 

l _ 

> 

_  3 

4 

1 

0.900 

0.920 

0.880 

0.841' 

2 

0.920 

0.950 

0.910 

0.880 

3 

0.940 

0.940 

0.  940 

0.900 
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As  explained  earlier  (Section  3.2)  when  the  accession  mix 
parameters  and  the  retention  probabilities  are  treated  as  random 
quantities  (assumed  to  be  estimated  from  historical  data),  their 
variances  are  assumed  to  be  proportional  to  1/(Z  n^)  and  l/n^m> 
respectively,  where  n^  is  t^e  number  of  people  in  YOS  1  and  category  m 
in  the  historical  sample. 

Specifically,  recall  that  the  retention  probability  for  the 

i^  YOS  and  cell  m  is  modeled  as  a  normal  random  variable  with 

given  mean  p  and  variance  p^^  (1  -  p^)  /  n^,  and  that  the 

accession  parameters  are  modeled  as  a  Dirichlet  distribution,  so 

that  the  accession  parameter  for  cell  m  has  mean  *  and  variance 

m 

i!  (J-t  )  /  ('  n,  +  !)•  Since  (for  realism)  we  wish  n .  to 
m  m  k  1  k  lm 

varv  proportionately  to  the  work  turce  .size  N  (and  for  convenience 

and  ease  of  computation),  we  introduce  proportionality  parameters  , 

duf ined  as  y,  *  n.  /N,  or  n.  =  '»  .  N. 

im  lm  lm  tm 

Tlie  fixed  values  of  v  .  are  displayed  in  Table  3.  Thus,  for 

im 

example,  if  N  =  100,  the  number  of  people  t,  in  the  historical 

data  set)  in  the  second  YOS  and  third  category  is  about  12  (so  the 
variance  of  P2j  is  proportional  to  1/12).  For  N  =  1,000,  this  number 
is  about  124  (so  the  variance  of  p^  is  proportional  to  1/124). 

We  have  characterized  the  uncertainty  in  the  accession  mix  param¬ 
eters  and  the  retention  rate  estimates  as  depending  on  N,  the  size  of 


Table  3 

PROPORT  I0NAI.1TY  PARAMKTKRS 

Category 


YOS 

1 

2 

3 

4 

1 

0. 1290 

0.0161 

0.  1432 

0.032  3 

2 

0.0903 

0.0181 

0.  1239 

0.0238 

3 

0. 1016 

0.0068 

0.0903 

0.0271 

4 

0.0890 

0.013s 

0.0733 

0.0133 
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the  work  force  being  considered.  For  convenience,  the  initial  work  force 

structure  N.  (0)  is  also  directly  proportional  to  N.  To  begin  the  compu- 
im 

tation  we  must  input  the  initial  force  f  N.  (0)  }.  We  set  N.  (0)  =  N<$ .  , 

lm  lm  lm 

where  the  relative  mix  parameters  are  listed  in  Table  A.  For  ex¬ 
ample,  if  N  =  1,000,  then  initially  there  are  100  people  in  Category  3 
in  their  second  year  of  service. 

Table  4 

RELATIVE  MIX  PARAMETERS 

Category 


YOS 

_ l _ 

2 

3 

A 

1 

0.  130 

0.020 

0.120 

0.0A0 

2 

0.090 

0.030 

0.100 

0.030 

3 

0. 100 

0.020 

0.080 

0.0A0 

A 

0.080 

0.030 

0.070 

0.020 

The  expected  costs  EC^  are  listed  in  Table  5. 

Table  5 

EXPECTED  COSTS 

Category 


YOS 

l 

2 

3 

A 

1 

o 

o 

7,200 

7 ,  A00 

7,300 

2 

8,200 

8,000 

8,200 

8,000 

3 

9, 100 

8,900 

9,100 

8,900 

A 

9,600 

9,300 

9,600 

9,300 

Here  we  assune,  only  for  simplicity,  that  costs  do  not  depend  on  time  t. 
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The  standard  deviations  of  the  costs  for  each  category 
are  displayed  in  Table  6. 

Table  6 

STANDARD  DEVIATIONS  OF  COSTS 

Standard  Deviation 

600 
550 
700 
800 

To  obtain  the  covariances  between  costs  for  different  categories  and 
years  of  service  as  described  in  Section  3.2,  we  set  ,  =  0.80. 

Using  these  example  data,  we  turn  our  attention  to  our  two 
primary  concerns,  the  "nonlinearity  of  expectations"  problem  and  the 
associated  uncertainty  in  computed  estimates  of  (random)  quantities. 

In  our  analysis  we  examine  five  different  cases  corresponding  to 
increasing  levels  of  uncertainty  in  the  model: 

Case  0:  The  completely  deterministic  case  with  no  uncertain 
parameters,  no  uncertainty  in  stav/leave  decisions, 
and  no  uncertainty  in  the  distribution  of  accessions 
among  the  m  categories. 

Case  1:  Uncertainty  in  stav/leave  decisions  (character iEed  by 

known  p's),  and  proportional  accessions  with  known  r' s. 

Case  2:  Same  as  Case  1,  but  assuming  multinomial  accessions  with 
known  it's. 

Case  3:  Same  as  Case  2,  hut  with  estimated  (random)  p's  and 
it’ s . 

Case  4:  Same  as  Case  3,  hut  with  random  L*  (accession  shortfalls 
are  allowed) . 


Y0S 

i 

1 

n 

3 

4 
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In  Case  4,  the  actual  number  of  accessions  L  is  assumed  to  be 
a  random  variable  with  mean  u  L  and  variance  fci  L,  where  L  is  the 
required  number  of  accessions  to  keep  the  total  force  size  fixed  at 
N  (see  Appendix  A  for  details) .  +  In  the  limit  (as  the  time  horizon 
grows)  the  expected  value  of  the  total  force  size  is  slightly  higher 
than  ..  N.  In  our  analysis  we  chose  a  =  b  =  0.90. 


3.4.2.  Nonlinearity  in  Means 

Current  personnel  force  flow  models  typically  involve  nonlinear 
equations  that  relate  descriptive  random  variables  (such  as 
accessions,  attrition  rates,  reenlistments  and  costs),  but  use  only 
mean  values  of  these  random  variables  in  computations  and  completely 
ignore  the  associated  uncertainties  in  the  results.  One  of  the 
primary  goals  of  our  research  lias  been  to  determine  the  effects  of 
uncertainties  in  these  variables  on  the  computed  outputs  of  such 
models . 


We  have  found  that  for  the  limited  (but  representative)  parameter 
values  we  have  employed,  the  "nonl ineari t v  in  expectations"  problem 
is  not  serious.  The  computed  means  vary  little  among  the  four  cases 
examined.  It  can  he  seen  from  the  equations  in  Appendix  A  that  except 
for  reenlistment  rates,  there  is  no  difference  in  means  between  cases 
l  and  2.  (Required  reenlistment  rates  are  an  exceptional  case  since 
they  are  inverses  of  random  quantities.  Keen) istment  rates  will  be 
discussed  in  greater  detail  below.)  The  reason  is  that  the  computations 
of  the  mean  involve  the  same  values  of  tt  in  the  fixed-proportion  and 
multinomial  cases  and  do  not  involve  the  variances.  Case  0,  the  deter¬ 
ministic  case,  will  have  the  same  mean  as  cases  1  and  2  (again,  except 
lor  reenlistments).  Moreover,  it  can  be  shown  1 rom  results  in  Appendix 
A  that  the  fourth  case  is  "linear" — that  is,  any  change  in  the  mean  is  due 
to  the  "shortfall"  parameter  a  and  not  to  a  nonlinear  equation.  Thus 
only  in  case  3  (randomness  In  retention  and  mix  parameters)  is  there  a 


t- 

We  assume  that  Oil.  _  L.  Further,  as  can  be  seen  t rom  the  equations 
in  Appendix  A,  the  results  onl>  depend  on  the  mean  and  variance  ot  1.  , 
and  not  on  the  form  of  its  probability  density  function. 


✓ 
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"nonl inearity  in  expectations"  problem.  But  for  accessions, 

year-group  size  and  costs,  the  means  vary  little  between  Case  2  and 

case  3.  As  can  be  seen  from  the  tables  in  Appendix  B,  the  difference 

in  means  (for  N  =  1,000)  is  generally  much  less  than  1%.  Even  for  N  =  100, 

the  difference  in  means  for  accessions  between  case  2  and  case  3  is 

at  most  2%-4%.  For  fourth-year  group  size  (still  N=100),  the  difference 

for  the  total  group  and  also  the  large  cell  is  2%-4%;  for  the  small 

cell  it  is  at  most  16%.  However,  since  the  expected  cell  size  is  only 

about  1.2,  this  16%  difference  is  uninteresting.  For  costs,  the  difference 

is  generally  0.2%  or  less.  It  can  happen  for  these  three  quantities 

that  the  mean  may  either  increase  or  decrease  between  case  2  and  case  3. 

As  is  expected,  there  is  always  a  decrease  in  mean  between  case  3  and 
case  4. 

It  is  also  of  interest  to  examine  the  quotients  of  computed 
means  for  the  different  work  force  sizes — e.g.,  comparing  the 
accessions  mean  for  N  =  1,000  with  that  for  N  =  100.  For  all  computed 
quantities  (accessions,  fourth-year-group  size,  and  cost)  we  found 
that  the  ratio  of  the  computed  mean  for  N  =  1,000  to  that  for  N  =  100 
was  always  10/1  for  case  1  and  case  2  (as  is  expected).  However, 
for  cases  3  and  4,  the  ratio  is  not  generally  10/1,  but  may  vary 
from  9.75  (accessions  into  a  large  cell)  to  11.82  (for  the 
fourth-year  group  in  a  small  cell).  This  is  due  to  the  fact  that 
the  variances  of  the  p's  and  tt's  are  taken  as  inversely  proportional 
to  the  work  force  size  and  not  due  to  any  underlying  nonlinearity  of 
expectations . 

The  mean  required  reenlistment  rate  is  the  expectation  of  the  inverse 
of  a  random  variable.  The  function  1/x  is  more  curved  when  x  is 
closer  to  zero.  Thus  if  the  mean  of  the  four th-year-group 
size  is  small,  we  could  expect  the  mean  required  reenlistment 
rate  to  be  affected  by  the  "nonlinearity  problem."  But  t  tie 
variance  of  the  fourth-year-group  size  also  is  potentially  important. 

The  mean  reenlistment  rates  can  be  approximated  by  a  Taylor  series 
expansion  of  the  function  1/x.  As  shewn  in  Appendix  A,  the 
approximation  involves  the  use  of  both  the  mean  and  variance  of  the 
fourth-year-group  size  (either  in  total  or  by  cell).  Since  the 
variance  of  the  fourth-year-group  size  increases  over  all  four 
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cases  (as  described  below) ,  and  since  the  fourth  year-group  mean 
decreases  between  case  3  and  case  4,  the  mean  required  reenlistment  rate 
steadily  increases  from  case  1  to  case  4.  However,  this  increase 
in  mean  is  so  slight  as  to  be  almost  negligible,  with  the  exceptions 
of  case  4  and/or  small  cell  size. 

For  N  =  1,000,  the  change  in  mean  reenlistment  rate  for  the  total 
group  and  for  the  large  cell  is  at  most  2%  across  the  5  cases;  for  the 
small  cell  it  is  at  most  14%.  For  N  =  100,  the  total  reenlistment 
rate  mean  varies  at  most  3%;  for  the  large  cell  it  varies  at  most  10%; 
the  computed  mean  reenlistment  rate  for  the  small  cell  exceeds  1, 
because  of  the  Taylor  approximation,  and  hence  is  not  meaningful. 

In  summary,  our  analysis  shows  that  the  "nonlinearity  in  means" 
question  is  generally  no  real  problem,  and  that  using  expected  values 
in  nonlinear  equations  to  relate  various  quantities  such  as  accession 
quantities,  reenlistment  rates,  fourth-year-group  size  and  cost  is 
a  safe  practice,  except  when  N  (and  consequently  some  expected  cell 
sizes)  is  small. 

3.4.3.  Incraase  in  Uncertainty 

Although  the  degree  of  nonlinearity  now  appears  of  little 
practical  concern,  the  magnitude  of  uncertainty  can  be  substantial — at 
least  warranting  explicit  consideration.  As  shown  in  the  next  sec¬ 
tion,  actual  values  of  various  random  quantities  of  interest  can 
be  quite  different  from  their  expected  values.  This  undercuts  con¬ 
fidence  ir.  mean-value  predictions  of  accession  quantities,  reeniist- 
ment  rates,  etc. 

To  assess  the  magnitude  of  uncertainty  associated  with  our 
outputs,  we  use  the  coefficient  of  variation  (CV) — the  standard 
deviation  of  a  quantity  divided  by  its  mean.  Notationally ,  this  is 
CV  =  a/p.  Thus  we  may  write  the  mean  "plus  or  minus"  two  standard 
deviations  as  p+2c  =  p(l+2CV) .  As  can  be  seen  from  this  formula, 
all  we  need  do  to  assess  the  magnitude  of  uncertainty  is  to  compare  the 
CV  (or  2 -CV)  with  1. 
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Appendix  B  consolidates  important  outputs  from  nunerous  model 
runs.  For  all  four  stochastic  cases  and  for  each  year  in  the  10-year 
planning  horizon,  the  means,  standard  deviations,  and  coefficients  of 
variation  are  tabulated  for  total  accessions,  four t h- year-group  size, 
required  reenlistment  rate,  and  annual  cost.  For  illustration,  the 
first  three  measures  are  also  tabulated  for  two  subsets  of  the  work 
force:  the  largest  and  smallest  "cells"  (e.g.,  male  high-school 

graduates  and  female  nongraduates,  respectively).  Generally,  the 
CVs  increase  yearly,  they  typically  approach  their  limiting  values 
after  about  four  years  (primarily  because  the  model  assumes  a  four- 
year  term  of  service),  they  increase  from  case  1  to  case  4,  they  are 
substantially  larger  when  N  is  smaller,  and  they  are  larger  for  sub¬ 
sets  of  the  work  force  than  in  total. 

For  total  accessions,  we  found  that  the  CV  increases  slightly 
over  the  first  three  cases,  but  that  for  case  4  it  is  double 
the  value  of  case  1.  For  case  4,  the  CV  (10th  year)  is  0.316 
for  N=10G  and  0.103  for  N=1000.  The  large  increases  are  from  case  0 
to  case  1  (the  simple  addition  of  binomial  choices)  and  from  case  2 
to  case  3  (randomness  in  the  p's  and  tt's).  For  the  large  cell 
(cell  3)  the  CV  doubles  from  case  1  to  case  3  and  increases  moderately 
for  case  4.  All  four  sources  of  uncertainty  contribute  sig¬ 
nificantly.  The  results  for  the  small  cell  (cell  2)  are  even  more 
dramatic.  Here  we  find  that  in  some  cases  we  get  CVs  greater  than 
one,  for  N=100. 

Figure  1  displays,  as  a  function  of  work  force  size,  the  mean  plus 
or  minus  twice  the  standard  deviation  for  total  accessions  and  for 
accessions  into  the  largest  and  smallest  personnel  categories  (cells). 
The  graph  is  based  on  case  3  (uncertainty  in  attrition  behavior, 
accession  mix,  and  in  estimation  of  the  p's  and  it's).  Note  that  uncer¬ 
tainty  is  substantially  larger,  relatively,  for  smaller  subsets  of  the 
work  force. 

For  total  fourth-year-group  size,  the  CV  generally  increases  by  a 
factor  of  2  to  3  from  year  1  to  year  10.  It  increases  slightly  over 
the  first  three  cases,  then  nearly  doubles  for  case  4.  For  case  4, 
the  CV  ranges  from  0.  192  for  N  =  100  to  0.104  lor  N  =  1000.  The  large 


Fig.  1 — Expectations  and  uncertainties  in  accession  quantities 
(fifth  year  in  the  planning  horizon) 
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increases  are  from  case  0  to  case  1  and  from  case  3  to  case  4.  The 
CV  for  the  fourth— year  group  in  cell  3  (the  large  cell)  increases 
substantially  over  all  four  cases.  !t  increases  by  a  tact  or  or  more 
than  2  between  case  1  and  case  4.  In  case  4  the  CV  ranges  iron 
0.460  lor  N  =  100  to  0.14H  for  N  =  1000.  For  the  sn.il  1  el  !  the  t  V 
increases  by  a  factor  of  three  from  case  1  to  case  4.  It  i  •-  gr.-atet 
than  1  (for  N=100)  for  both  cases  J  and  4. 

For  total  required  reenlistment  rates,  the  CV  increases  only 
slightly  for  the  first  three  cases,  then  nearly  doubles  for  the  fourth 
case.  In  case  4,  the  CV  varies  from  0.297  for  N  *=  100  to  0.103  for 
N  =  1000.  Here,  again,  the  large  increases  are  due  to  case  1  and  case  4. 

For  the  large  cell,  each  case  contributes  significantly  to  the  increase  in 
CV.  For  the  fourth  case,  CV  ranges  in  value  from  0.379  for  N  =  100  to 
0.145  for  N  «  1000.  For  the  small  cell,  the  CV  again  increases  signifi¬ 
cantly  for  each  case.  For  case  1,  the  CV  is  0.346  at  N  =  100  and  is 
0.125  at  N  =  1000.  For  Case  4,  it  is  0.482  and  0.344,  respectively. 

Figure  2  displays  a  confidence  band  for  the  overall  required  reenlistment 
rate  as  a  function  of  the  work  force  size  N  for  the  fifth  year  in  the 
planning  horizon  (again  for  case  3),  Recall  that  this  rate  is  uncer¬ 
tain  because  it  depends  on  the  random  number  of  people  who  actually 
complete  four  years  of  service;  the  desired  (required)  reenlistment 
quantity  is  held  fixed.  Note  th<v_  the  confidence  band  widens  con¬ 
siderably  when  a  small  work  force  is  considered  but  remains  fairly 
stable  even  for  a  fairly  large  first-term  work  force.  (Recall  again 
that  these  work  force  sizes  are  representative  for  many  Air  Force 
occupational  specialties.) 

As  may  be  seen  from  Appendix  B,  the  coefficient  of  variation  for 

the  overall  total  cost  was  only  about  7%  (0.07).  Since  the  input 

standard  deviations  of  the  individual  costs  C.  (t)  were  on  the  order 

lin 

of  about  7%  of  the  means  (the  C^'s),  this  would  indicate  that  the 
cost  results  are  dominated  by  these  inputs.  To  verify  this,  we 
doubled  the  input  standard  deviations  of  cost  and  found  that  the  CV 
of  the  overall  total  cost  approximately  doubled  to  0.14.  Thus,  cost 
uncertainties  are  sensitive  to  the  ascribed  values  of  these  standard 


Reenlirtment  rate  required  to  enter  N  people  into  the  career  work  force 


-29- 


Fig.  2  -  A  confidence  band  for  required  reenlistment  rate 
(fifth  year  in  the  planning  horizon.  Case  3) 
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deviatlons,  and,  consequently,  their  accurate  prediction  requires 
good  estimates  of  these  standard  deviations.  Such  estimates  could  be 
obtained  in  practice  by  using  random  samples  from  the  Uniform  Airman 
Record  (UAR) . 

Our  findings  regarding  the  increase  in  uncertainty  due  to  the 
successive  sources  in  cases  1-4  are  typified  graphically  in  Fig.  3. 

This  display  shows  the  coefficient  of  variation  in  each  case  for 
total  accessions,  required  reenlistment  rate,  and  cost.  Uncertainty 
increases  most  notably  with  smaller  work  force  sizes  and  with  the 
assumption  in  case  4  of  enlistment  shortfalls. 

Finally,  Figs.  4  and  5  reflect  the  dynamics  involved  in  our  model. 
Figure  4  is  a  plot  of  total  accessions  versus  time  for  the  ten-year 
horizon,  and  Fig.  5  is  a  plot  of  total  required  reenlistment  rate 
versus  time.  Again,  both  figures  are  based  on  case  3.  The  variation 
over  time  apparent  in  these  graphs  is  due  to  the  fact  that  the  initial 
work  force  configuration  is  not  the  mean  equilibrium  configuration. 

This  "zigzagging"  dampens  over  time,  and  the  expectations  of  these 
quantities  eventually  would  converge  to  stable  equilibrium  values. 

3.4,4,  Assessment  of  Risk 

These  results  indicate  that  uncertainties  in  projecting  the  values 
of  several  work  force  characteristics  can  be  substantial.  We  are 
naturally  led  to  wonder  about  the  probabilities  of  certain  events 
occurring.  For  example,  what  is  the  probability  that  more  than  X 
people  will  have  to  be  recruited  in  1984  in  order  to  maintain  a  first- 
term  work  force  of  a  specified  size,  or  what  is  the  probability  that  a 
reenlistment  rate  higher  than  40%  will  be  required  in  1987  in  order 
to  enter  Y  people  into  an  occupation's  career  work  force?  Using 
the  means  and  variances  identified  with  our  stochastic  flow  model,  we 
can  approximate  these  probabilities.  We  do  this  by  using  the  method 
of  moments  to  estimate  the  parameters  of  probability  distributions 
that  approximate  those  of  the  subject  random  variables.  For  example, 
since  N^(t)  =  £  N^Ct),  the  number  of  people  in  the  fourth  year  of 
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Fig.  3— Coefficients  of  variation  (fifth  year) 


Number  of  accessions 


service  in  year  t,  is  determined  as  the  result  of  a  large  lumber 
of  decisions  by  or  about  individual  airmen,  we  can  expect  that  it 
follows  a  normal  probability  distribution  approximately.  As  an 
illustration,  consider  N^(t)  in  our  example  case  with  N  =  100, 
multinomial  accessions,  and  uncertainty  in  the  estimates  of  r: 

m 

and  p.  (i.e.,  case  1).  In  this  case  K(N  (5))  =  21.8  and 
lm  4 

»Var[N^(5)]  =  1.1.  If  we  assune  that  N^(5)  has  a  normal  probability 

distribution,  then  we  can  determine  the  (approximate)  probability 
that,  say,  a  rt'enl Istment  rate  higher  than  40%  is  required  if  9  people 
are  to  enter  the  career  force  in  year  6: 


P(9/N.  (5)  >  .4)  *  P(N. (5)  <  9/. 4  =  22.5) 
4  4 


=  P(Z  <  (22.5  -  21 . 8)/3 . 3  =  .212)  -  0.584  , 


where  Z  is  the  standard  normal  random  variable. 

In  this  example,  suppose  that  a  required  reenlistment  rate  as 
high  as  40%  is  something  to  be  avoided,  for  example,  because  it  may 
require  a  reenlistment  bonus.  In  this  case  (and  in  general)  we  may  ask 
what  is  the  probability  that  our  expected-value  estimates  will  be  off  by 
particular  amounts — e.g.,  what  is  the  probability  that  our  estimate 
of  N^(5)  (E[N^(5)]  is  off  by  10%  or  more?  Figure  b  provides  a 
ready  means  for  determining  such  error  probabilities,  providing  the 
random  variable  of  interest  can  be  assumed  to  follow  a  normal 
distribution  approximately.  In  this  example,  the  coefficient  of 
variation  of  N^(5)  is  about  0.15  (3.3/21.8  -  0.15),  and  the  graph 
indicates  a  probability  of  about  0.48  that  the  actual  value  of 
N^O)  will  differ  from  E[N^(5)]  by  at  least  10%.  (Note:  0.48 
represents  an  interpolation  between  the  CV  =  .10  and  CV  =  .20 
curves  plotted  in  Figure  6.)  Naturally,  the  smaller  the  per¬ 
centage  error  we  consider,  the  higher  its  corresponding  probability. 

But  the  smaller  the  coefficient  of  variation  for  the  subject  random 
variable,  the  lower  the  probability  of  error.  The  dimensionless 
nature  of  the  graph  in  Figure  6  permits  its  use  for  estimating  the 
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Fig.  6 —  Probability  of  prediction  error  using  expected  values 


error  potential  implicit  in  any  of  the  mean  values  calculated  by 
our  flow  model--as  long  as  the  corresponding  random  variables  follow 
approximately  normal  distributions. 

It  would  be  comparatively  easy  to  include  this  kind  of  risk- 
assessment  capability  in  a  post-processor  for  deterministic  personnel 
flow  models.  That  is,  in  addition  to  calculating  the  standard 
deviations  (and  more  accurate  expected  values)  for  work  force 
characteristics,  a  stochastic  post-processor  could  compute  the 
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approximate  probabilities  of  certain  events  and/or  of  actual  results 
differing  from  mean-value  estimates  by  specified  amounts. 

As  indicated  in  this  study's  introduction.  Air  Force  analysts 
and  planners  might  desire  to  operate  personnel  flow  models  in  such 
a  way  that  they  preclude  management  actions  which  admit  unacceptable 
risks.  For  example,  they  may  wish  to  establish  recruiting  levels 
which  give  high  probabilities  that  the  numbers  of  people  subsequently 
available  for  reenlistment  will  be  sufficient  to  make  a  reenlistment 
bonus  unnecessary.  How  difficult  would  it  be  to  construct  flow  models 
that  could  identify  options  providing  protection  from  risk?  To  examine 
the  complexity  involved  in  the  necessary  calculations,  let  us  use  the 
example  mentioned;  that  is,  we  seek  to  determine  some  number,  A,  of 
people  that  should  be  recruited  to  assure  a  probability  of  at  least  b 
that  a  reenlistment  rate  no  higher  than  r  will  be  required  to  reenlist 
c  of  these  people  for  the  career  force.  More  specifically,  we  want  to 
find  the  minimum  value  a  such  that 

P(R  =  c/Y  r  |  A  =  a)  _>  b, 

where  R  is  the  (random)  required  reenlistment  rate  of  interest,  and  Y  is  the 

number  of  people  remaining  after  four  years  of  service  of  those  a  who 

are  recruited  initially.  Since  Y  has  a  conditional  binomial  distribution 

(conditioned  on  a)  in  case  the  first-term  annual  retention  probabilities 

are  known  with  certainty,  then  the  normal  approximation  to  the 

"f* 

binomial  distribution  can  be  employed  to  obtain 


2pc/r  +  z2p(l-p)  +  Y (2pc/r  +  z2p(l-p)]2  -  4  p2c2/r2 


n  * 

2  P 


where  p  =  E  tr  p,  p„  p_  represents  the  probability  that  an  individual 
r  m  nrlm  2m  3m 

recruit  makes  it  through  his  initial  four-year  obligation,  and  z^ 
satisfies  P(Z>z,  )  =  b.  For  example,  if  c  =  9,  p  =  .75,  r  =  .4,  and 

—  D 


^See  Appendix  C  for  the  derivation. 
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b  =  .90,  we  find  a  =  34.33  or,  rounding  up  to  an  integer,  35.  This 
contrasts  with  the  30  people  (determined  as  9/ [  ( . 4)  ( . 75)  ]  =  30) 
wt  would  obtain  from  a  deterministic  treatment  assuming  a 
reenlistment  rate  of  .4  could  be  attained.  Thus,  in  this  case, 
"insurance"  against  having  to  offer  a  reenlistment  bonus  costs 
about  17%  in  added  accessions  and  overall  (expected)  first-term  work, 
force  size  and  costs.  This  example  is  representative  of  a  work  force 
whose  first-term  component  contains  about  100  airmen.  For  a  first- 
term  work  force  of  about  1000  airmen,  we  can  change  c,  say  to  87,  and 
leave  the  other  parameters  fixed.  The  result  is  a  =  302,  represent¬ 
ing  about  4%  "safety  stock"  over  the  291  that  would  be  indicated  by 
deterministic  assumptions. 

The  situation  becomes  somewhat  more  complicated  if,  realistically, 
the  retention  fraction  p  is  not  known  precisely — as  is  the  case  when 

the  7i  '  s  and  p.  's  are  treated  as  random — because  then  the  random 

m  lm 

variable  Y  does  not  have  a  conditional  binomial  distribution.  As 
a  simple  illustration,  suppose  p  assumes  the  value  .75  with 
probability  .50  and  the  values  .60  and  .90  with  equal  probabilities 
of  .25.  In  this  case  the  expected  retention  rate  is  still  .75,  but 
its  standard  deviation  is  about  .11  and  its  coefficient  of  variation 
about  .14.  It  is  fairly  straightforward,  but  tedious,  to  ascertain 
in  this  case  that  a  =  39  if  c  =  9  and  that  a  =  367  if  c  =  87.  In  both 
cases,  additional  accessions  exceed  the  corresponding  deterministically 
determined  quantities  by  over  25%. 

In  reality  the  actual  distribution  of  the  value  of  p  is  very  much 

more  complex  than  the  simple  one  used  here.  In  principle  it  can  be 

determined  for  a  subset  of  the  work  force — e.g.,  a  CPU — by  examining 

the  distributions  of  its  determinants,  a  set  of  i  ’s  ai  i  p.  's.  But 

m  im 

in  practice  this  would  be  very  difficult,  and  t ho  mechanics  ot  an 
algorithm  to  perform  the  kind  of  calculations  accomplished  above  would 
be  quite  complex  in  the  presence  of  an  involved  distribution  for  p. 

Hence,  we  recommend  that  such  capabilities  not  be  attempted  in  personnel 
flow  models.  in  case  "protection"  from  undesirable  events  is  important, 
however,  it  can  be  obtained  by  adding  constraints  to  a  deterministic  (low 
model  run  in  conjunction  with  a  stochaslii  post -processor .  For  example. 
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the  deterministic  flow  model  could  contain  constraints  providing  lower 
limits  for  the  numbers  of  people  of  particular  types  which  should 
be  recruited  for  each  year  in  the  planning  horizon.  If  the  results 
provided  by  the  deterministic  model  do  not  provide  the  desired 
probabilities  for  particular  events — say,  reenlistment  rates  below 
specified  limits — then  the  lower  limits  on  accessions  could  be 
increased  and  the  deterministic  model  rerun.  This  process  could  con¬ 
tinue,  with  constraint  values  being  increased  or  decreased,  until 
acceptable  probabilities  are  achieved.  These  constraint  adjustments 
could  be  made  either  interactively,  with  program  users  observing 
intermediate  results  and  changing  parameter  values,  or  in  logical 
"loops"  which  would  adjust  constraints  using  specific-  .’es  and  re¬ 
run  the  deterministic  model  and  post-processor  until  the  results  meet 
a  priori  specifications.  We  recommend  these  iterative  approaches 
because  they  are  analytically  simple  and  computationally  practical. 
Descriptive  personnel  flow  models  typically  execute  in  very  short 
times,  and  they  can  be  rerun  with  different  constraints  very 
economically.  Incorporating  the  necessary  probabilistic  computa¬ 
tions  in  the  basic  flow  model  itself,  while  possible  in  principle, 
would  add  immensely  to  Its  complexity  and  computation  time  and 
would  make  its  initial  development  and  testing  much  more  difficult  and 
time-consuming. 


A. 


PREDICTING  PERSONNEL  FLOW  RATES 


The  flow  model  described  in  the  previous  section,  and  indeed  each 
of  the  flow  models  in  current  use  by  the  Air  Force,  assumes  that  the 
work  force  is  already  partitioned  into  subsets  or  categories  whose  re¬ 
tention  rates  differ.  Further,  the  means  (and  in  our  stochastic  model, 
the  variances)  of  these  flow  rates  are  critical  inputs  to  the  descrip¬ 
tive  models.  Of  course  there  are  many  reasons  why  it  is  important  to 
distinguish  personnel  categories  in  flow  models  —  e.g.,  behavioral 
differences  (of  primary  interest  is  retention  behavior),  productivity 
differences,  cost  differences,  and  availability  differences  (i.e.,  dif¬ 
ferences  in  the  numbers  of  people  in  the  enlistment-eligible  popula¬ 
tion  with  particular  attributes  and  differences  in  their  propensities 
for  joining  the  Air  Force).  In  this  section  we  focus  on  ways  to  dis¬ 
tinguish  categories  of  people  whose  behaviors  differ  and  to  charac¬ 
terize  those  differences.  These  fundamental  categories  and  associated 
flow  rates  will  continue  to  constitute  critical  inputs  for  personnel 
flow  models,  whether  the  models  are  deterministic  or  stochastic.  In 
either  case,  the  identification  of  the  categories  and  corresponding 
rates  remains  a  statistical  problem.  This  section  begins  with  a 
description  of  the  important  statistical  issues  relative  to  this  problem 
and  proceeds  with  brief  discussions  of  two  complementary  statistical 
modeling  approaches  we  believe  will  provide  the  requisite  capa¬ 
bilities  for  handling  these  issues. 

4,1.  Statistical  Estimation  Issues 

Whatever  the  techniques  used  to  examine  retention  behavior,  we 
believe  they  should  meet  three  important  criteria: 

o  Statistical  accuracy.  They  should  provide  accurate  predictions 
of  retention  rates,  the  precision  implicit  in  their  estimates 
should  be  characterized,  and  they  should  admit  to  convenient 
tests  of  hypotheses  (particularly  goodness-of-f i t  tests), 
o  Logical  consistency.  They  should  provide  interpretable  rela¬ 
tionships  between  variables  which  predict  retention  rates  and 
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the  corresponding  predictions,  and  their  stability  should  be 
assessed  (i.e.,  the  regular  presence  and  importance  of  the 
identified  characteristics  in  predicting  flow  rates). 

o  Environmental  robustness.  They  should  be  able  to  predict  re¬ 
tention  behavior  under  altered  personnel  management  policies 
such  as  revised  compensation  tables,  promotion  opportunities, 
and/or  retirement  programs. 

The  primary  retention  rate  estimation  technique  in  current  use  by 
the  Air  Force  is  the  Automatic  Interaction  Detector  (AID),  a  method 
which  partitions  its  data  sample  iteratively  using  the  explanatory 
characteristic  that  provides  the  maximum  decrease  in  overal 1  mean 
squared  prediction  error.  Its  users  have  found  this  method  usually 
adequate  for  predicting  overall  retention  rates,  but  have  noted  sub¬ 
stantial  errors  when  subsets  of  the  work  force  are  considered.  AID  is 
employed  within  a  special  Air  Force  information  system,  the  Airman 
Loss  Probability  System  (ALPS),  to  provide  flow  rate  estimates  for 
numerous  personnel  planning  and  programming  models.  ALPS  has  the  ca¬ 
pability  to  bypass  the  AID  partitioning/estimating  routines  for  sub¬ 
sets  of  the  work  force,  and  this  is  frequently  done  for  the  first-term 
component  of  the  work  force.  For  this  component  a  simple  set  of  pre¬ 
dictive  characteristics  is  input  to  ALPS  and  flow  rates  are  calculated 
for  the  corresponding  categories.  Another  estimation  procedure  involving 
trend  extrapolation  also  is  used  occasionally  for  first-terra  retention 
prediction. 

These  estimation/prediction  procedures  have  limitations  with 
respect  to  alL  three  statistical  criteria  cited  above.  They  seem 
to  suffer  least  from  lack  of  predictive  accuracy  --  at  least  in 
the  aggregate,  as  already  noted  —  although  problems  in  this  area 
have  led  to  recent  revisions  in  the  way  the  AID-ident i f ied  categories 
and  corresponding  rate  estimates  are  employed.  But  the  system  appar¬ 
ently  has  no  capabilities  for  characterizing  the  precision  implicit  in 
the  rates  it  identifies  or  for  subjecting  them  or  their  underlying 
structure  to  goodness-of-fit  tests,  although  "validation"  runs  listing 
comparisons  between  predicted  and  actual  retention  quantities  are  made 
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regular  ly .  Another  structural  limitation,  at  least  in  the  basic  AID 
logic,  is  an  inability  to  consider  possible  time  trends  in  the  sample 
data.  Regarding  the  logical  consistency  criterion,  we  note  that  the 
categories  of  airmen  identified  by  the  AID  logic  are  not  always  the 
same.  That  is,  some  predictive  characteristics  appear  to  influence 
retention  behavior  more  during  some  time  periods  and  less  during  others. 
In  fact  this  may  be  characteristic  of  an  observation  made  by  Doyle 
and  Fenwick  [9]:  The  sequential  AID  logit  tan  "find"  explanatory 
power  (in  characteristics)  where  it  doesn't  exist,  and  miss  it  where 
it  does.  A  further  logical  shortcoming  of  AID  is  that  it  does  not 
permit  systematic  study  of  possible  interactions  among  predictive  char¬ 
acteristics.  (For  example,  educational  background,  mental  aptitude, 
sex,  and  race  may  interact  in  subtle  ways  which  would  contribute  to 
understanding  retention  behavior  and  possibly  point  toward  use¬ 
ful  personnel  management  policy  revisions.)  Finally,  with  respect  to 
the  environmental  robustness  criterion,  current  methods  provide  no 
real  capability  to  predict  retention  behavior  under  revised  management 
policies . 

In  the  remainder  of  this  section  we  discuss  briefly  two  improve¬ 
ments  which  can  enhance  considerably  the  Air  Force's  capabilities  for 
flow  rate  estimation:  (1)  application  of  log-linear  models  for  be¬ 
havioral  category  identification  and  rate  estimation,  and  (2)  develop¬ 
ment  of  a  sequential  decisionmaking  model  for  prediction  of  flow  rates 
under  altered  management  policies.  As  we  will  see,  use  of  log-linear 
models  should  provide  a  sound  logical  and  statistical  foundation  for 
rate  estimation  in  the  absence  of  policy  change  and  should  identify 
distinct  categories  of  personnel  for  which  the  sequential  decision¬ 
making  models  should  be  employed  separately.  The  sequential  decision- 
making  model  can  be  based  on  the  model  developed  by  Gotz  and  McCall  [IT]' 
for  prediction  of  officer  personnel  retention, 

4.2,  Category  Identification  and  Rate  Estimation  in  the 
Absence  of  Policy  Change 

We  propose  the  use  of  log-linear  models  to  establish  the  rela¬ 
tionship  between  flow  rates — e.g.,  attrition,  extension-of-obl igat ion. 
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and  reenlistment — and  various  explanatory  (predictor''  categorical 
characteristics,  such  as  mental  category,  educational  background, 

training,  job  category,  history  of  experience  in  the  work  force,  etc. 

Our  purpose  is  to  ensure  statistical  soundness  in  inferences  being  drawn 
from  the  available  data  and  to  lay  a  solid  foundation  for  the  development 
of  a  sequential  decisionmaking  model  to  be  used  in  predicting  behavioral 
changes  in  attrition  (and  other)  rates  due  to  possible  changes  in  Air 
Force  policy. 

To  develop  appropriate  structures  tor  flow  models,  we  need  to  know 
what  character ist ics  generally  distinguish  personnel  categories  and  how 
those  character i st ics  interact.  In  our  view  the  most  reasonable  approach 

to  identifying  and  analyzing  these  characteristics  employs  log-linear 
models  for  discrete  multivariate  data.  This  method  is  based  on  sound 
statistical  theory,  its  results  submit  readily  to  tests  of  significance, 
and  it  possesses  a  number  o:  other  advantages  mentioned  in  the  (cl  low¬ 
ing  brUi  discussion  ot  the  ana!’,  sis  approach . 

Typical  1  y ,  in  l  is-  ’itn.il  analysis,  we  have  a  sample  set  ol  size  X.  an, 
each  data  point  falls  within  one  of  several  categories.  For  example,  in 
a  study  of  attrition  rates,  we  may  have  a  group  of  enlisted  airmen  cate¬ 
gorized  according  to  characteristics  such  as  marital  status,  race,  mental 
category,  educational  level,  skill  level,  geographical  origin,  etc.  and 
according  u  whether  they  6tay  in  or  leave  the  service  in  the  observed 
t  ime  pe  r  u  A  . 

i  1  .  :  i  ’  •  .  ...  i'  'st  i  .i,  j.i*  urn  is  >•  l.issi  1  ied  by  the  values  of 

tine*  .i :  t  :  «-f  •  <  i  ■>.  :  •  '  ■  ■  i  r  i  ib  i  is  t  ■  n.iraet  er  ist  ics)  labeled  A,  B,  and 

C.  Assume  that 

A  has  I  levels  (values) 

B  has  J  levels  (values) 

C  has  K  levels  (values) 

For  example,  A  might  represent  retention  behavior  (two  values:  stav 
or  leave)  during  some  t  itr.  per  fed,  R  might  represent  mental  category 
(say,  using  the  four  major  values),  and  C  might  represent  race  (three 
values:  white,  black,  other).  Thus  there  are  2  x  4  x  3  =  24  "elementary" 

cells.  Of  course,  we  may  also  have  variables  D,  E,  F,  ...,  and  so  on. 


General  1 v ,  we  are  restricted  to  those  variables  maintained  in  the  Uniform 
Airman  Record  (VAR)  for  information  on  individuals  in  the  Air  Force. 


Assam' t  hat  the  .lata  set  ol  size  N  represents  ;  iie  out  civ.es  ot 
st  ’  die  a.  i .  e \ i •  e r  its -i’.t  ,  let 

=  probability  of  an  observation  falling  in  cell  (i,  j,  k) 


and 


'  i  ik 


expected  count  for  cell  (i,  j,  k) 


i-1 


1  IK 


•  ivtai  r.iede  ■  is  i  ■  1 )  t  lined  by  writ  in.-  t  :  ,;ur.i  i  log  tr  ! !  n: 

:  -ell,  luts  is  i  l  i  near  •  iu.it  :  •  •  t «  •  :  *. .  r:r.  vh  i  ■'<: 

rep  re.~  -a:  "i  fleet-  ”  tine  to  tiie  characteristics  A,  R .  and  C,  and  to  their 
various  c  >n-.i  in  «t  ions  (i.e.,  their  interaction  effects).  Formally,  by 
an:iLs;v  with  analysis-  ot  variance,  w-  wi  i  t  .  - 


lo?  :”i.iK  '  11  +  U1  ( i)  4  u2 ( j )  +  u  Hk)  +  u  1  2 (  i  j  1  +  U1 3  I  j  1. 1 
4  U23(.jk)  +  U12  t(  i  ik)  ’ 

where  tf >  variables  u  represent  the  linear  contributions  of  the  various 

combinations  of  the  characteristics  A,  B,  and  C  to  the  logarithm  of 

m... ---hence  tin-  name  "  I  .'g- !  i  near  models."' 
i  jk 

Thus,  the  probabilities  of  Interest  are  found  by  computing 


P.- 


i  jk 


exp  I u  +  u 


1  (i) 


+  u 


2(j) 


+  u 


123(ijk) 


)  /N 


and  then  taking  the  proper  summations. 

For  example,  using  the  definitions  of  the  three  categories  given  above 


'The  symbol  +  used  as  a  subscript  denotes  summation  over  all  values 
of  the  corresponding  index. 


-  Prob  (a  person  chosen  at  random  leaves) 


and 


The  u^2  and  other  pairwise  u-terms  are  the  two-factor  effects; 

U12J  is  the  three-way  interaction  term.  If  the  u^,  U13  and  ui23 

terms  are  all  zero,  the  three  variables  are  mutually  independent.  Tf 

=  0  but  the  others  are  not,  we  have  all  two-way  interactions  present 
but  no  three-way  interaction. 

Maximum  likelihood  is  the  method  employed  to  fit  these  models.  In 
some  situations  exact  closed-form  solutions  can  be  obtained.  Generally, 
however,  iterative  proportional  fitting  methods  must  be  employed.  Com¬ 
puter  programs  for  this  purpose  are  available. 

When  the  model  is  saturated  (no  u-terms  are  taken  as  zero) ,  we  have 
as  many  parameters  to  estimate  as  there  are  "elementary  cells";  otherwise, 
we  may  have  far  fewer  parameters  to  estimate.  The  choice  of  variables 
to  be  included  in  log-linear  models  and  the  examination  ol  the  fit  must 
be  made  carefully,  since  in  "near-saturated"  models  there  may  be  many 
u-terms  to  estimate. 

In  the  initial  stages  of  analyses,  it  is  wise  to  fit  only  the  sim¬ 
plest  of  models,  models  with  no  more  than  two-factor  interaction  effects. 
There  are  several  reasons  for  this. 

1.  We  can  obtain  cell  estimates  for  every  cell  in  a  sparse  array : 
fitting  unsaturated  models  gives  estimates  for  elementary  cells  that 
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linvi'  positive  probabilities  but  no  sample  observations.  For  example,  a 
particular  sample  may  have  no  black,  female  airmen  in  mental  category  II 
in  ^  certain  specialty,  and  yet  the  probability  of  such  an  occurrence  may 
not  be  zero. 

1.  Models  with  two-lactor  effects  yield  elementary  cell  estimates 
that  are  more  stable  than  observed  cell  counts.  Successively  higher-order 
terms  can  be  regarded  as  deviations  from  tire  average  value  of  related 
lower-order  terms,  and  so  models  with  the  higher-order  terms  removed  are 
useful  in  describing  the  gross  structure  of  a  data  array.  Such  models 
describe  general  trends  and  hence  can  be  regarded  as  "smoothing"  devices. 

3.  Simple  models  facilitate  the  detection  of  outliers.  The  detec¬ 
tion  of  sporadic  cells  t ha t  are  unduly  large  may  be  of  importance.  For 
example,  it  will  be  desirable  to  determine  which  combinations  of  variable 
categories  give  an  excessive  number  of  leavers  from  the  work  force. 

As  an  example,  one  may  find  that  married  personnel  with  a  good 
educational  background  and  a  high  skill  level  have  a  higher  attrition 
rate  because  of  the  interaction  of  these  characteristics. 

After  initially  fitting  the  model  with  two-factor  interactions  only, 
the  model  can  be  extended  (if  necessary)  to  include  higher-order  inter¬ 
action  terms.  It  is  also  possible  that  some  two-factor  interactions  could 
be  dropped  from  the  model.  We  should  always  seek  to  develop  as  simple  a 
model  as  possible  that  is  still  consistent  with  the  data,  since  generally 
it  is  much  easier  to  interpret  the  parameters  of  a  simple  model  than  of 
a  more  complex  one.  Additionally,  a  model  with  fewer  parameters  may  im¬ 
prove  the  precision  of  the  parameter  estimates. 

Log- Linear  models  have  the  additional  capability  of  using  the  natural 
ordering  of  categories.  In  our  example,  A  and  C  (retention  behavior  and 
race)  are  not  ordered,  whereas  B  (mental  category)  is  naturally  ordered. 

The  natural  ordering  can  be  used  by  assigning  ordered  scores  to  the  various 
levels  (values)  of  the  ordered  categories  (characteristics) .  This  is 
useful  in  reducing  the  number  of  general  (higher-order)  interaction  u-terms 
and  aids  in  developing  understandable,  interpretabl e  and  effective  models. 
For  example,  in  a  two-way  table  with  variables  A  (retention  behavior)  and 
B  (mental  category),  a  general  log-linear  model  would  have  the  form 

2  (.] )  12(ij) 


log  m. .  =  u  +  u1(i) 
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However,  since  B  has  ordered  levels,  it  may  be  preferable  to  examine  the 
model  in  which  scores  v^,  v^,  v^,  are  assigned  to  mental  categories 
I,  IT,  III,  and  IV,  so  that 

log  m.  .  =  u  +  u,  ...  +  u0/.s  +  (v.  -  v)u'  .. 
ij  l(i)  2 ( j )  j  l(i) 


where  v  is  the  average  of  the  v's.  Such  a  model  has  fewer  parameters  to 
estimate,  and  adds  only  a  few  extra  degrees  of  freedom  to  the  no-interaction 
model. 

We  may  also  use  log-linear  models  to  analyze  discrete  multivariate 
data  forming  a  series  through  time — i.e.,  a  Markov  chain.  Wo  may  wish 
to  analyze  trends  in  attrition  rates  as  they  change  over  time.  Log- 
linear  models  can  easily  be  adapted  to  this  type  of  problem,  whereas 
other  modes  of  analysis  (such  as  AID)  do  not  generally  lend  themselves 
to  such  an  investigation. 

Logit  regression  models  are  a  special  type  of  log-linear  model.  By 
treating  certain  marginal  totals  as  fixed,  we  may  rewrite  a  log-linear 
model  for  the  variables  A,  B,  and  C  as 

w  +  v2(j)  +  w3(k)  +  w23(jk)  • 


This  results  from  the  fact  that  the  conditional  probability  of  attrition 
given  characteristics  (j,  k)  can  be  written 


P(1 1 j»  k) 


ml,jk 

m  ., 
+jk 


in  ~  ,1 

p(0|j,  k)  =  1  -  p  Cl  I . j ,  k)  =  — J- 

m+jk 


Hence, 
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and 


log 


/P(i 

Ip(o 


Ug  )  =  log  P&) 

J’k)/  VmOjk  / 


log  m1;jk  -  log  m0jk 


We  may  use  logit  regression  models  where  the  explanatory  variables 
are  continuous  and/or  discrete.  Further,  we  may  relate  attrition  rates 
to  various  economic  variables  such  as  inflation  rates,  the  cost  of  living 
index,  joblessness,  etc.,  to  develop  a  model  of  attrition  rates  dependent 
on  both  personal  variables  (in  the  UAR)  and  economic  indexes  varying  over 
t  ime  . 

Additionally,  it  should  be  noted  that  log- linear  models  and  logit 
regression  models  can  be  extended  so  that  the  dependent  variable  A  is 
multinomial  (or  polytomous) — i.e.,  A  may  have  more  than  two  outcomes, 
such  as  leave,  extend  or  reenlist. 

In  all  the  above  models,  careful  attention  must  be  paid  to  the 
analysis  of  residuals  and  various  "goodness-of-f it"  criteria  to  detect 
any  serious  model  inadequacy.  Well-established  methods  exist  for  ex¬ 
amining  the  fit  of  log- linear  models  to  actual  data.  Alternative 
methods,  like  AID,  generally  ignore  the  question  of  mode]  fit;  they 
provide  simple,  untested  point  estimates. 


A. 3.  Behavioral  Response  to  Policy  Changes 

As  mentioned  above,  log-linear  models  include  logit  regression 
models  as  a  special  case,  and  logit  models  commonly  have  been  used  to 
estimate  stay/ Leave  behavioral  alteration  in  response  to  policy  or 
environmental  change.  However,  logit  models  do  not  implicitly  rep¬ 
resent  decisionmaking  by  individual  airmen. 

Airmen's  decisions  occur  primarily  near  reenlistment  points,  and 
these  decisions  are  subject  to  some  influence  through  personnel  manage¬ 
ment  policies  such  as  bonus  levels  and  promotion  rates.  Gotz  and 
McCall  [11  ]  have  developed  a  sequential  decisionmaking  model  of 


stay/ leave  behavior  for  Air  force  officers  which  otfers  two  ke. 
advantages  over  alternative  retention  rate  estimation  procedures: 

o  History  dependence.  The  dynamic  programming  model  of  C.otz  and 
McCall  demonstrates  that  retention  rates  of  Air  Force  officers 
depend  both  on  prospective  future  financial  returns  to  remaining 
in  the  military  and  on  past  occurrences.  Their  analysis  shows, 
for  example,  that  ordinary  regression  models  can  overpredict 
retention  rates  for  years  beyond  the  offer  of  a  bonus.  These 
models  ignore  the  fact  that  some  individuals  may  have  stayed  in 
service  only  to  obtain  the  bonus;  hence,  their  post-bonus 
retention  rates  should  be  expected  to  be  lower.  The  important 
extension  in  the  Gotz-McCall  model  which  allows  such  behavior 
to  be  predicted  is  explicit  incorporation  of  a  term  representing 
permanent  differences  in  individuals'  tastes  for  the  military. 
(Of  course  the  distribution  of  the  t  istes  must  he  estimated 
empirical lv . ) 

o  Structure  which _ incorporates  management  pol  icies  directly. 

Personnel  policies  affecting  individuals'  income  streams 
(i.e.,  expected  military  versus  civilian  incomes  witli 
differences  depending  on  compensation  tables,  promotion 
opportunities,  retirement  pay,  and  other  financial  benefits) 
are  represented  explicitly  in  the  underlying  sequential 
decisionmaking  model. 

We  believe  this  sequential  modeling  concept  should  be  developed 
further  and  generalized  for  application  to  enlisted  retention  modeling. 

Key  differences  between  the  Gotz-McCall  model  and  the  sequential  mode  1 
for  enlisted  personnel  may  include: 

o  Multiple  years  between  decision  points.  Although  every  year 
of  service  sees  some  airmen  leave  the  Air  Korce--e.g.,  due  to 
health  problems,  personal  emergencies,  or  unsatisfactory- 
behavior  or  job  performance — most  airmen  face  continuation 
decisions  at  four-year  intervals,  the  usual  enlistment  or 
reenlistment  obligation.  This  contrasts  with  an  officer's  more 
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frequent  opportunities  to  make  stay/leave  decisions,  and  it 
necessitates  structural  differences  in  a  sequential  model 
representing  enlisted  personnel  decisionmaking. 

o  Extension  of  obligation  beyond  normal  enlistment  terms.  Near 
the  end  of  an  enlistment  term,  an  airman  judged  by 
the  Air  Force  to  be  acceptable  for  continued  service  has  a 
third  option  in  addition  to  reenlisting  or  separating:  ex¬ 
tension.  That  is,  the  airman  can  extend  his  or  her  current 
term  of  service  somewhat  and  delay  the  stay/leave  decision. 

This  option  needs  explicit  representation  in  a  behavioral 
model  based  on  decisionmaking  timing  and  options  for  enlisted 
personnel . 

o  Transient  differences  in  "taste"  for  the  military.  Enlisted 

people  typically  enter  the  service  much  younger,  less  educated, 
less  experienced,  and  with  less  forethought  than  officer  per¬ 
sonnel.  Hence  they  cannot  be  expected  to  have  as  stable  an 
affinity  for  the  Air  Force.  Their  impressions  of  service  life 
may  be  much  more  influenced  by  their  induction,  training, 
assignment,  and  initial  work  experiences  than  are  those  of 
officers.  The  Air  Force  is  often  the  first  full-time, 
long-term  job  for  enlisted  people,  and  they  really  don't  know 
what  to  expect  either  from  their  employer  or  from  themselves 
in  their  newfound  responsibilities  and  independence.  Thus, 
the  "taste"  term  to  be  employed  in  a  sequential  decisionmaking 
model  of  their  behavior  may  need  to  be  generalized  so  that,  at 
least  during  the  early  years  of  service,  it  can  follow  different 
distributions.  Alternatively,  the  annual  "disturbance"  factor 
represented  in  the  Got^.-McCall  approach  might  be  allowed  to 
play  a  larger  role  until  airmen  have  sufficient  time  and  experi¬ 
ence  to  stabilize  their  impressions  of  military  life. 

o  Crosstraining.  In  contrast  to  officers,  enlisted  personnel 
receive  virtually  all  of  their  job-related  training  directly 
from  the  Air  Force;  essentially  they  are  "given"  occupations 
by  their  employer.  Sometimes,  when  occupations  become  over- 


manned,  a  condition  for  continued  employment  can  be  that  an 
individual  must  change  occupations.  This  usually  necessitates 
a  period  of  c ross training,  whether  formal  or  informal,  and 
can  result  in  changed  working  conditions,  a  different  set  of 
possible  assignment  locations,  altered  advancement  potential, 
different  opportunities  in  civilian  life,  etc.  Such  a  change 
is  usually  more  drastic  and  may  be  less  expected  than  corre¬ 
sponding  changes  experienced  by  officer  personnel;  hence  it 
may  need  explicit  representation  in  an  enlisted  decisionmaking 
mode  L . 

The  kev  linkages  between  the  log-linear  modeling  approach, 
which  Ionises  on  identification  of  categories  of  personnel  whose 
retention  behaviors  differ,  and  the  sequential  decisionmaking 
modeling  approach,  which  focuses  on  how  behavior  will  change  uiul.-i 
altered  management  policies,  are  the  representations  of  tastes  for 
the  military  and  of  transient  disturbances  affecting  continuation 
decisions.  We  expect  that  the  same  general  model  structure  can  permit 
estimation  of  personnel  flow  rate  changes  under  altered  policies 
regardless  of  behavioral  category,  but  the  different  categories  will 
require  different  model  parameters.  Thus,  the  modeling  approaches 
are  complementary:  the  log-linear  model  provides  an  initial 
"filter"  to  -separate  and  identify  behav  iora  1 1  y  distinct  categories 
of  personnel,  and  the  sequential  decisionmaking  model  predicts 
how  each  category's  retention  behavior  will  change  if  management 
policies  are  changed. 


b.  CONCLUSIONS  AND  RECOMMENDATIONS 


We  have  ascertained  that  projections  for  many  work  force  character¬ 
istics  can  incorporate  sizeable  uncertainties:  "two-sigma"  confidence 
intervals  often  contain  values  differing  10%-40%  from  corresponding  ex¬ 
pectations.  Thus,  especially  when  smaller  segments  of  the  work  force 
are  considered,  substantial  deviations  from  expected-value  projections 
should  be  expected  fairly  frequently.  From  our  very  limited  computa¬ 
tional  experience,  it  appears  that  the  largest  contributor  to  this  un¬ 
certainty  is  usually  the  simple  uncertaintv  in  individual  stav/leave 
behavior  (regardless  of  whether  the  individual  or  the  Air  Force  makes 
the  determination).  Another  potentially  large  contributor  is  uncer¬ 
tainty  in  tire  proportion  of  accession  requirements  which  actually  can  he 
met.  Uncertainties  regarding  the  mix  of  people  that  can  be  accessed 
and  regarding  estimates  of  flow  rates,  while  they  can  be  important  in 
projecting  values  for  certain  subsets  of  the  work  force  (e.g.,  the  num¬ 
ber  of  minority,  male,  and  high-school  graduates  in  a  particular  <T<; 
who  wilL  be  eligible  for  reenlistment  live  vears  hence)  ,  appear  to 
contribute  less  to  uncertaintv  in  overall  work  force  characteristics 
(e.g. ,  the  total  number  of  people  in  a  CPC  who  will  he  eligible  for 
reenlistment  five  years  hence). 

While  we  see  that  uncertainties  can  be  substantial,  we  find  that 
one  of  our  original  concerns,  the  "nonlinear i tv  of  expectations"  problem 
is  not  too  important.  That  is,  at  least  for  the  limited  (but  representa 
tive)  parameter  values  we  have  employed,  the  nonlinear  equations  which 
relate  descriptive  random  variables  (the  random  variables  representing 
enlisted  work  force  characteristics  such  as  accessions,  attrition,  reen¬ 
listments,  and  costs)  also  hold  approximately  when  the  random  variables 
are  replaced  by  their  expectations.  Thus  we  can  expect  deterministic- 
personnel  flow  models,  which  typically  employ  the  best  available  esti¬ 
mates  of  expected  values  within  systems  of  nonlinear  equations,  to  vield 
fairly  accurate  predictions  of  the  remaining  expectations.  Hence,  as 
long  as  our  primary  interests  are  in  expected  values,  we  needn't  make 
the  substantial  extra  effort  to  develop  models  which  include  uncertainty 
explicitly . 
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When  our  interests  shift  to  risk  aversion,  things  get  mud)  more 
complicated.  Since  we  have  not  evaluated  the  entire  distribution  of 
the  work  force  "state,"  we  cannot  make  precise  statements  about  the 
probabilities  for  joint  events  (e.g.,  of  needing  reenlistment  rates 
no  higher  than  40%  in  1981-1984)  .  But  this  work  does  enable  us  to 
determine  the  approximate  probabilities  of  individual  events  (e.g., 
the  probability  that  a  reenlistment  rate  no  higher  than  40%  will  be 
required  in  1983  or  the  probability  that  accessions  in  1982  need  not 
exceed  250  people  in  a  particular  CPC) .  These  approximations  are  ob¬ 
tained  by  employing  the  calculated  means  and  variances  of  the  corre¬ 
sponding  random  variables  to  estimate  the  parameters  of  a  specified 
probability  distribution,  usually  the  normal  distribution.  Since  these 
means  and  variances  are  determined  in  the  course  of  the  model's  exe¬ 
cution,  it  is  a  simple  matter  to  determine  the  approximate  probabilities 
upon  run  completion.  But  if  execution  is  to  be  affected  bv  limits  on 
these  probabilities  (e.g.,  access  enough  people  in  1982  to  provide  90% 
confidence  that  in  1986  a  reenlistment  rate  no  higher  than,  say,  40% 
will  be  required  to  provide  some  fixed  number  of  career  entrants  within 
a  CPG) ,  the  ongoing  calculations  become  exceedingly  complex. 

Thus  we  recommend  that  personnel  flow  models  not  be  encumbered 
with  these  intricate  calculations  during  their  basic  operations;  i.e., 
they  should  continue  to  be  constructed  as  deterministic.  However, 
uncertainty  can  be  significant  enough  that  we  recommend  that  such 
models  have  appended  stochastic  "post-processors"  which  evaluate  asso¬ 
ciated  means,  variances,  and  approximate  event  probabilities  using  the 
methods  we  have  developed  here.  This  will  provide  ready  assessments  of 
the  extent  of  uncertainty  in  model  projections.  Further,  if  risk  aver¬ 
sion  is  important,  such  post-processors  can  be  employed  to  identify  con¬ 
straints  which  should  be  incorporated  in  the  deterministic  flow  models. 
This  could  be  accomplished  either  interactively,  involving  personnel 
policy  analysts  in  changing  inputs  and  exercising  the  flow  models  itera¬ 
tively,  or  directly,  by  specifying  in  advance  desired  probabilities  for 
certain  events  and  imbedding  the  flow  models  in  "logical  loops"  which 
would  alter  and  rerun  them  until  acceptable  results  are  achieved. 


Regarding  the  stochastic  inputs  necessary  for  personnel  flow  models 
(e.g.,  upgrade,  reenlist'  t,  and  loss  rates),  we  believe  improved 
estimation  procedures  should  he  developed.  These  methods  should  provide 
consistent,  interpretable,  and  parsimonious  sets  of  parameters  for  esti¬ 
mating  flow  rates,  they  should  incorporate  time  series  data  (in  order  to 
detect  and  project  underlying  trends),  they  should  include  "environmental' 
data  such  as  occupational  categories  and  corresponding  civilian  economic 
conditions,  and  they  should  admit  to  statistical  goodness-of-f i t  pro¬ 
cedures  so  that  their  accuracy  can  be  assessed  systematically.  Uncer¬ 
tainty  in  these  rate  estimates  is  particularly  important  since,  as  we 
have  seen,  it  can  contribute  significantly  to  uncertainty  in  related  work 
force  character  is t ics .  We  recommend  that  the  first  step  in  investigating 
these  I  low  rates  be  to  employ  log-linear  models  (with  logit  models  as  a 
special  case)  to  identify  categories  of  enlisted  personnel  whose  flow 
behaviors  differ — e.g.,  subdividing  the  work  force  according  to  occupa¬ 
tional  subsets,  educational  backgrounds,  mental  aptitudes,  marital 
status,  etc.,  as  appropriate.  The  second  step  should  be  to  develop  a 
sequential  decisionmaking  model  which  will  predict  how  the  flow 
behaviors  for  the  various  categories  of  personnel  will  change  if  manage¬ 
ment  "control"  policies  such  as  compensation,  promotion  opportunity,  edu¬ 
cational  benefits,  or  retirement  programs  are  changed.  These  estimates 
of  revised  behavior  under  altered  policies  are  obviously  crucial  if  flow 
models  are  to  be  useful  in  evaluating  and/or  selecting  improved  personnel 
management  policies.  This  step  also  should  result  in  estimates  of  flow 
rates  and  their  inherent  uncertainty,  again  because  of  the  potential  im¬ 
portance  of  this  uncertainty  when  the  flow  rates  are  employed  in  des¬ 
criptive  flow  models. 

We  are  convinced  that  substantially  improved  personnel  flow  models 
can  be  developed  ror  Air  Force  use  in  developing  and  evaluating  alter¬ 
native  personnel  management  policies--especiallv  if  smaller  segments  of 
the  work  force  are  to  he  examined  (1)  simultaneously  with  the  total  work 
force  and  (2)  dynamically.  Ideally,  these  models  wind  1  he  constructed  to 
achieve  directed  results,  for  example,  hv  using  opt  inti  /.at  ion  techniques 
to  pursue  user-determined  objectives.  We  have  concluded  that  these 


models  should  be  developed  as  deterministic  flow  models;  but  uncertainty  i 


- 'in¬ 


sufficiently  important  that  such  models  should  include  stoctiastic  post¬ 
processors  to  evaluate  the  decree  of  uncertainty  implicit  in  identified 
results.  In  addition,  careful  attention  must  be  given  to  estimation  of 
important  flow  rates  which  ordinarily  are  model  inputs--par ticularly  to 
altered  flow  rates  which  may  apply  under  different  management  policies 
and  to  the  categories  of  personnel  for  which  these  behaviors  apply. 


i. 
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Api'L  iulix  A 

DEVELOPMENT  OF  THE  STOOiASTI C  FLOW  MODEL 

A. 1 .  Introduction 

This  appendix  gives  a  detailed  description  of  the  stochastic 
(dynamic)  flow  model.  As  described  in  Section  3.1,  the  inputs  to 
the  model  are: 

o  N  (0) ,  the  initial  number  of  people  in  YOS  i  and 
category  m 


o  Fixed  force  size,  N 

o  The  retention  rates  Pim=  probability  that  a  person 

in  class  (i,m)  (YOS  i,  category  m)  will  flow  into 

class  (i  +  1,  m)  in  the  next  time  period.  Thus, 

1-p.  are  the  attrition  rates.  (The  retention 
*im 

rates  may  depend  on  time  t;  i.e.,  the  model 

considers  the  values  p.  (t)). 

r  lm 

o  Accession  mix  . 

o  Costs,  C.  (t)  . 

im 

o  Planning  horizon,  T. 

Since  each  individual  in  class  (i,m)  stays  in  the 
service  with  probability  p^  ,  it  is  clear  that 
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N.  ,  (t  +  1)  =  number  of  individuals  in  Y03  i  +  1, 
i  +1 ,  m 


class  m  in  year  t  +  1 


N. 

lm 

=  I 
X=1 


(t) 


X 


X 


where  X^,  X^,  ...  are  independent  and  identically  distributed 
Bernoulli  random  variables  with  probability  of  success  p  ;  i.e., 

P  <xx  ■  1)  ■  1  -  e  <*,  ■  °)  ■  Pi„. 

The  flow  of  people  through  the  work  force  can  be  illustrated 


as  follows: 

Y0S=1 

2 

3 

4 

t  -  1 

N.  (t-l) 

jin 

N.  (t-1) 
4m 

Time  t 

%(t^> 

N2„(t> 

(t) 

4m 

t  +  1 

N2m(t+1) 

Sv--a 

N,  (t+1) 

3m 

N.  (t  +  1) 
sin 

In  our  model  it  is  convenient  to  assume  the  process  starts 
•  it  time  t  -  0  and  runs  until  t  =  I'  (the  planning  lie  r  i /on  X  .  'I  he 
values  of  N  (0)  are  given  (fixed),  and  the  process  is  allowed 


to  evolve. 
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Fundamental  questions  to  which  we  require  answers  are: 

1.  How  do  we  compute  E  (N^ffi(t)  j  N  (t-1)), 

the  conditional  expectation  of  N.  (t) 

im 

given  the  values 

N  (t-1 )  =  N  ( t-1 1 :  1  •  J  '  A,  1  - '  '  M  ? 

IV  —  —  — 

^  J  ' 


2.  How  do  we  compute  Cov  [N^m(t),  N^(t)  ]  N(t-l)], 
the  conditional  covariance? 

j.  How  do  we  then  compute  EN’im(t)  and  Cov  [Nim(t),  N  (t)], 
the  unconditional  means  and  covariances? 

4.  Are  there  closed-form  analytic  solutions  to  the  above 
questions,  or  must  we  perform  simulation  runs  (Monte 
Carlo  runs)  to  obtain  the  answers? 

5.  What  happens  when  T  -*•  K  ?  Is  there  a  long-run, 

steady-state  (equilibrium)  distribution  for 

{N.  (t)}  ?  If  so,  can  it  be  characterized? 
lm 

In  answer  to  the  fourth  question,  if  the  p  's  and  ti^'s  are 
fixed  parameters  (not  random),  then  we  can  obtain  simple  analytic 

expressions  for  the  above-mentioned  quantities.  If,  however, 

either  the  p.  ' s  or  the  ti  's  are  treated  as  random,  then  we  must 

perform  simulation  runs  to  obtain  the  answers — repeatedly  sampling 

from  the  distribution  of  the  p,  's  and  the  it  's. 

lm  m 

Since  the  n^'s  may  be  fixed  (hence  the  proportional  or 
multinomial  model)  or  random,  the  p  's  may  be  fixed  or  random, 
and  the  (t)'s  may  be  fixed  or  random,  there  are  essentially 
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3  x  2  x  2  =  12  different  cases  that  can  be  treated.  We  treat 
here  only  the  3  most  interesting  cases: 

1.  p  and  t  fixed,  Ti  proportional,  C  random 

2.  p  and  •'  fixed,  ~  multinomial,  C  random 

3.  p  and  ”  random,  -  multinomial,  C  random 

The  first  three  of  the  above  questions  are  answered  in  Section  A. 2. 
Section  A. 3  discusses  the  modeling  of  parameter  uncertainty,  and 
Sections  A. 4  and  A. 5  discuss  incorporation  of  parameter  uncertainty  in 
the  flow  model  (as  well  as  the  answer  to  the  fourth  question).  Section 
A. 6  derives  approximations  to  the  mean  and  variance  of  reenlistment 
rates,  and  the  last  section  answers  the  f i question  concerning  the 
existence  of  a  long-run,  steady-state  distribution  for  the  flow  model. 

A. 2.  Dynamic  Equations  for  the  Means  and  Covariances  of  System  St. ate 

Contents 

In  this  section,  we  develop  expressions  to  compute  iteratively  the 

means  and  covariances  of  N.  (t+1)  as  functions  of  N..(t);  i.e.,  we 

lm  j  .. 

compute  the  conditional  means  and  covariances 
E  (Nlm(t+1)  j  N  (t ) ) 

Cov  [N.m(t+1),  Njf(t+1)  |  N  (t)] 

where  N  (t)  =  (N.  (t)).  We  also  develop  the  unconditional  means  and 

lm 

covariances  at  time  t  +  1  as  functions  of  the  unconditional  means  and 
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covariances  at  time  t.  Hence,  if  the  process  starts  at  the  initial 

values  N®  =  N.  (0)  at  time  t  =  0,  we  can  trace  its  evolution  as  t 

im  im 

grows . 

Up  until  now,  we  have  assumed  that  the  accession  mix  r  and  the 

transition  probabilities  p  =  (p.  )  do  not  depend  on  t.  However,  to 

„  im 

remind  ourselves  that  a  change  in  policy  or  behavior  at  time  t  can 

affect  both  "  and  p,  wt  show  the  dependence  of  ~  and  p  on  t  by  writinc 

T  ( t)  -  (  '  ( t ) )  and  pit)  =  (p .  ( t) ) . 

m  in 

Throughout  this  section,  neither  n(t)  nor  p ( t )  is  random. 

We  develop  these  results  for  two  importantly  different  cases: 
in  the  first  case  the  work  force  size  is  held  constant  at  N,  in  the 
second  it  is  allowed  to  fall  below  N  (i.e.,  the  possibility  of  recruiting 
shortfalls  is  introduced).  We  will  treat  both  t he  fixed  proportional 
case  and  the  multinomial  case  for  r  (t). 

A .  2 . 1  .  Fixed  Work _ Force  Size 

First,  to  obtain  the  conditional  means  and  covariances,  let  I. 

I.(t)  =  N  -  11  r.  N  (t)  =  number  of  accessions  required  for  the  planning 

)>2  C  1 

year  t. 

(i).  Fixed  proportional  case.  We  have  N^Ct)  =  so 

E(N^(t)  |  L)  =  1Tm(t)L  and  Var  (Nlnj(t)  (  L)  =  0. 

(ii).  Multinomial  case.  (liven  the  vector  (N  ^  (t)  , .  .  .  ,Nj  (t  )) 

has  a  multinomial  distribution,  i.e.,  (t  ) , . . .  ,  N  (t))  - 

11  1 M 

W  (L,  it  (t),...,  tt  (t)).  Hence,  E  (N,  (t)  I  L)  =  tt  (t)  L  as  before,  but 
in  im  m 

Var  (N  (t)  |  L)  =  it  (t)(l-ir  (t))  L 
im  m  m 

and 

Cov  (Nlm(t),  Nu(t)  |  L)  =  -trm(t)  nt(t)  L  . 
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To  keep  the  notation  consistent,  we  assume  that  the  transition 

Nj  1(t)->-Nj+1^(t+l)  is  determined  by  Pj^(t),  and  the  distribution  of 
(t+l)»- . . ,  Njj^(t+l))is  determined  by  n  (t).  Jn  our  derivation, 
we  make  extensive  use  of  the  fact  that,  given  N(t), 


Ni»<c> 

Vi,«<t+1>  -  V 


where  X^  are  independent,  identically  distributed  (i.i.d.),  having  a 

binomial  distribution  witii  parameters  1  and  p^(t). 

a.  The  Conditional  Means.  For  t>0  and  l<i<3,  i.e.,  for  the  cate¬ 
gories  of  continuing  personnel,  we  have 


Nim(t> 


E(N 


.  (t+1)  N(t))  =  E  (  E 

i+!,m  -  X-1 


Xx  |  N  (t ) ) 


=  Pim(t)  Nim(t)- 


The  conditional  means  for  the  accession  categories  are  the  same 
whether  we  consider  the  fixed  proportion  or  the  multinomial  case. 

We  have 


E(Nlm(t+1)  I  ?(t))  =  11m(t)  E  (L<t+1>  I  N(t)) 


=  *  (t)  [N  -  S  E  E(N  (t+1)  |  N(t) )  ] 
i>2  l  ->*• 


b.  The  Conditional  Covariances.  For  the  continuing  categories  of 
personnel,  again  for  1  <  i  <  3  and  t  >  0,  we  have 


1 
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N.  .  . 

im(t) 

Var  (Ni+l  m(t+1)  '  N(t))  =  var  (  £  X  |  N(t)) 

’  '  A=1 

=  pim(t)  tl-pim(t)]  Nim(t) 

The  variances  of  the  accession  quantities  N,  (t)  depend  on  whether 

lm 

we  treat  the  fixed  proportion  or  the  multinomial  case.  For  brevity  we 
treat  the  two  cases  simultaneously.  The  difference  is  that  an  extra  term 
enters  in  the  multinomial  case;  we  handle  this  by  introducing  the  indicator 
variable  I  as  follows: 

I  +1  if  in  multinomial  case 

1  = 

I  0  if  in  fixed  proportion  case 


We  then  have  (for  the  first  equation,  see  DeGroot  [8]  )> 

Var  (Nlm(t+1)  |  N(t) )  =  Var  (E(Nlm(t+l)  |  L(t+1),  N(t))  |  N(t)) 

+  E(Var(Nln)(t+l)  |  L(t+1)  ,  N(t))|  N(t)) 

=  Var  (t )  L(t-Fl)  |  N(t)) 

+  I  E(n  (t)fl-7i  (t))L(t+l)  1  N(t)) 
mm  _ 

=  tt  2(t)  Var  (N-  l  l  N.At+1)  |  N(t)) 
m  j>2  2 

+  1  tt  (t)  [  1-tt  (t)  ]  (N  -  l  Ip.  ,  At)  N,  „(t)} 
m  m  j>2  2  ^  J-1 

=  "  2(t)  Z  T.  Var(N..(t+l)  |  N(t)) 
j>2  2 


because,  as  we  show  below,  if  (j,£)  i  (k,n)  and  j  -2,  k -2  then 
Cov(Nj ^(t+1) ,  N^n(t+1)  |  N(t))  =  0.  The  equation  continues  as 

2 


Var  (N,  (t+1) 
lm 


N(t)>  =  7T  (t)  Z  Z  p.  f(t)  [1-p.  ,  „(t) 

j>2  e  J"1>£ 


J-i.« 


(t) 


+ 


71  (t)  [l-’t  (t))  {N-  Z  Ip  .(t)  N.  .  (t) } 

m  m  . .  „  „  j-1 , t  j-1, t 


j>_2  l 


To  obtain  the  covariances,  we  first  consider  the  cases  where  (i,m)  ^  (j,j) 


^  2,  J  i  2.  Then  if  {X.}  are  i.i.d.  b(l,p.  (t))  and  (X  '}  are  i.i.d 

a  lm  \i 

b(l»  P j  ^  C t ) ) ,  and  if  {X^  J  and  fX^  *  J  are  independent  systems,  we  get 
Cov(Nim(t+1)’  Njl(t+1)  I  N(t)) 

=  Cov(  I  X  ,  Z  X’  |  N ( t ) ) 

A-l  p=l  u  ' 

•=  U,  since  X.,  X’  are  independent. 

A  VJ 

Now,  in  case  i  -  1  and  J  ^2,  Ve  have* 

Cov(Nlm(t+l),  N  (t+1)  |  N(t) ) 

=  E  [Cov(Nlm(t+l),  N.^t+l)  |  N.e(t+1),  N(t) )  |  N(t)  ] 


That  is,  the  random  variables  are  independent  and  identically 
distributed  (i.i.d.),  having  a  binomial  distribution  with  parameters 
n  ■  1  and  probability  of  success 


Pjt(t). 


See  De  Groot  [8]  for  the  relevant  conditional  result  which 
says  that  for  random  variables  X,  Y  and  Z, 


Cov(X.Y)  *  E [Cov(X,Y | Z) ]  +  Cov[E(x|z),  E(y|z)]. 
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+  Cov  [E(Nlm(t+l)  |  N  ^(t+1),  N(t)),  E(N  (t+1)  | 

N  (t+1),  N (t ) )  |  N(t)  ] 

=  0  +  Cov  [it  (t)  E(L(t+l)  S  N.  (t+1)  ,  N  ( t )  )  ,  N  (t+1)  |  N(t)  ] 

m  j  Jc  -  J 

=  tt  (t)  Cov  [N  -  IE  p  (t)  N  (t)  -  N  (t+1),  N  (t+1)  I  N  (t) ) 
m  k>_2,n  k-l,n  k-l,n  •* 

Ot,n)  J  (j,£) 

=  -  tt  (t)  Var(N  (t+1)  |  N (t ) ) 
m  j  x.  , 


'  -  \M  IT-Pj-l.tTO) 


In  case  i  =  j  =  1  and  m  j  l,  we  have 


Cov(Nlm(t+l),  Nu(t+1)  [  N  (t)) 


E  [Cov(Nlm(t+l),  Nia (t+1)  |  L(t+1) ,  N(t) )  |  N(t) ] 


+  Cov[E(N]m(t+l)  |  L ( t+1 ) ,  N  (t) ) ,  E(N12(t+l) |L(t+l),  N(t))|N(t)] 


I  •  E ( — tt  (t)  tt  ( t )  L(t+1)  |  N(t) ) 

m  Jc  .. 


+  CoY^Ct)  L(t+1),  7r^(t)  L(t+1)  |  N(t ) ) 


=  -  I  •  irm(t)  if £(t)  E (L ( t+1)  I  N (t )  ) 


5 


j 

j 


+  TT  (t)  TT  (t)  Var  (L(t+1)  I  N(t) ) 
m  i  _ 
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=  -  I  •  TT  (t)  W  (t)  {  l  E  P  (O  N  (t)  ) 

m  k>2  n  k-l,n  k-1  ,n 


+  TT  (t)  IT  (t)  {  E  Ip  (t)  [1-p  (t)  I  N  (t)i 

m  1  k>2«  k-l,n  k-1  ,n  k-l,n 


In  summary,  if  we  let  a(t)  =  N  -  E  E  p.  ..  (t)  N,  .  (t) 

k>2  n  k'1,n  k"1,n 


=  El L( t+1 ) | N ( t ) ! 

and 


b  (t) 


E  E  p 
k>2  n 


k-1  ,n 


(t) 


Pk-l,n(t)  1  Nk-l,n 


(t) 


=  Var lL(t+l) | N( t) ] , 


then  for  i  >  2,  j  >  2,  in  M  , 


i  ?(t)>  ■  n-i,»(t)  N£-i,»(t) 

E(N,  (t-tl)  I  N( c) )  =  ir  (t)  a(t) 

lm  _  m 

Var(Nlm(t+1)  |  N(t»  =  Pi.1>m(t)  H  - 

Var  (N .  (t+1)  1  N  (t) )  =  I  •  it  (t)  [1  -  tt  (t)]  a(t)  +  tt  ^  ( t )  b(t) 
lm  ^  ra  m  m 


Cov(Nlm(t+l),  Nj  ^ (t+1) 

1  N(t)>  =  0 

Cov(Nlm(t+l),  N  (t+1) 

|  N (t ) )  =  -  tt  (t)  p  .  (t)  [  1- 

~  m  j  —  I ,  ic 

"Pj-1 , C  (t  > 1  Nj-l,£(t) 

Cov(Nlm(t+l),  Nlt(t+1) 

(  N(t)  )  =  Tim(t)  TiJ,(t)  {-I  a  ( t ) 

+  b(t) } 

.1 
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Thus  the  only  differences  between  the  fixed  proportion  case  and  the 
multinomial  case  are  the  variances  and  covariances  involving  the  first 
year  of  service. 

c.  The  Unconditional  Means  and  Covariances.  Next,  we  derive  the 
unconditional  means  and  covariances  of  N(t  +  1)  as  functions  of  the 
means  and  covariances  of  N(t)  .  From  the  preceding  equations,  we  have 


EOVt+n). 

E<V<‘+i»-  V‘>  E  Pk-!,n(t> 

Var  N  (t+1)  -  E  [Var (N  (t+1)  I  N  (E))J  +  Var  IECN.  (t+1)  I  N(t))J 
lm  im  1  _  lm  ' 


=  P.  ,  _(t)  [1-p.  .  (t) ]  E  N  .  (t) 

i-l, m  i-1, m  1-1, m 


+  P,  ,  _  (t)  Var  N  .  (t) 

1-1, m  l-J  ,m 


Var  Nlm(t+1)  =  E  [ Var (N^ (t+1)  |  N(t))}  +  Var  [E(Nlm(t+l)  |  N(t))] 


=  I  •  n  (t)  [1-tt  (t)]  {N  -  l  Ep,  ,  (t)  E  N,  .  (t)} 

m  m  ,  „  rk-l,n  k-l,n 

k>2  n 


+  (t )  {  Z  l  p  (t)  [  1-p.  ,  ( t  >  ]  E  N  (t)} 

di  to  K  1 1  n  K  l  •  n  k  i » n 

k>2  n 


+  TT^  (t )  {  T.  I  p  (t)  p  (t)  Cov(N (t),  N  £(t)) 
m  jc>2  n  k.i,n  1  1 ,  jc  k.l,n 


To  compute  the  unconditional  covariances,  we  again  use  the  fact  that 
Gov (X,Y)  =  E[Cov(X,Y|z) ]  +  Cov[E(x|z),  E(Y|z)]  to  obtain: 


'For  a  random  variable  X,  we  use  the  notation  EX  or  E(X)  for  the  mean 
and  Var  X  or  Var(X)  for  the  variance. 
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Cov  (N.  (t+1),  N . „ (t+1) ) 
lm  JH 


p  .  .  ( t )  i).  , 

1  ]-l  , 


(t)  Cov(N. 


i  - 1  ,  m 


(t),  N 


i-1 . 


(t)) 


Cov  (Nlm(t+1),  N  (t+1))  =  -Wt)  Pj_lft(t)  , ( t )  ]  E  Nj_! s .  <t) 


-TT  (t)  p.  (t)  ...  .  p.  .  (t)  Cov  ( N  (t),N.  . 

m  i-l,i  ,  ..  k  -  i  ,  n  k- 1  ,  n  j  - 1  ,  - 

k  •  2  n 


Cov  (N  (t+1),  Nu(t+1)) 


U  (t)  ”.(t)  1-1  IN  -  :  p.  ,  (t)  i:  N  (t)  1 

m  «.  ,  k-  1  ,  n  k- 1  ,  n 

k -2  n 


k>2  n 


p,  .  (t)  [ 1-p  (t  )  ]  K  N,  ( t ) 

k-  1  ,  n  k-  1  ,  n  k- J  ,  n 


+  it  (t)it  (t){  2  I  p,  ,  (t)p.  .  (t)Cov(N,  .  (t) 

"  1  k>2  „,l  t_l'n  l'U'  k  "" 


A.  2. 2.  Random  Work  Force  Size* 

Until  now,  we  have  assumed  that  we  may  recruit  as  many  people  as 
necessary  to  keep  the  total  size  constant.  We  now  assume  that  there 
may  be  shortfalls,  so  that  the  recruiting  quota  is  not  alwavs  met. 

Specifically,  in  our  earlier  notation,  if  wo  need  to  recruit  a 
total  of 


i .  -  l  ( t  +  i)  =  n-  ::  >;  n.  (t  +  i) 
i .  .2  e  I1 

recruits,  to  keep  the  force  size  at  N,  we  assume  tliat  the  actual  number 
* 

I.  of  people  recruited  is  a  random  variable  whose  distribution  depends 


U)  ) 


-(>  7- 


on  I,.  To  introduce  this  type  of  uncertainty,  we  shall  assume  that  both 
the  mean  and  variance  are  linear  functions  of  I.;  i.e., 

■  *  i 

■  *  *  L  =  K  (.1.  I  h)  =  conditional  mean 

*  i 

•  !.  =  Var(l.  |  I.)  =  conditional  variance. 

For  example,  it  the  distribution  of  h,  given  h  is  binomial  with 

parameters  h  and  q  ,  written  (1,  J  I.)  '  h(L,  q)  ,  then  a  =  q  anil 

* 

F  ~  (j(l  -  q)  .  1  1  i  =  1  and  h  =  0  ,  then  1,  s  L,  and  we  recruit 

as  many  as  are  needed. 

We  shall  assume  that  p  and  n  are  nonrandom,  .and  that  we  have 

•••  >v 

the  multinomial  case  tor  accessions.  Note  that,  given  L  ,  the  dis¬ 
tribution  of  Njj(t  +  1),...,  N1M(t  +  1)  is  ^(L  ,  'M), 

‘k 

that  is,  multinomial  w  i  t  la  parameters  1.  and  .  As  before, 

1  M 

we  firsL  obtain  the  conditional  means  and  covariances,  and  then  the 
unconditional  means  and  covariances. 

a.  The  Conti  i  t  iona  I  Moans.  For  I  i  •  1  and  t  ■  0  ,  we  have, 

for  the  categories  of  continuing  personnel, 

K(N  (t  +  l)  j  N ( t ) )  =  p.  (t)  N.  (t)  . 

t+  i ,  m  1  .  i  m  in 

For  the  accession  categories,  we  have 

F.(N,  (t  +  1)  |  N  (t ) )  =  KUHN,  (t  +  1)  i  1.,  MID)  N  ( t  )  1 

1  m  1  m 

=  K  ( E  ( F  ( N  (t  +  I)  I  1*,  I.,  N  ( I  ) )  I.,  Sltll  Nltl) 

1  m 

fhe  lixed  proportional  case  lor  accessions  can  also  he  easily 
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E(E(nm(t)  1.  |  L,  N ( t )  )  |  N  ( t ) ) 


E(n  (t)  E(L  |  L)  j  N ( t ) ) 
m  1  _ 


ii  (t)  E(a  L  N  (  t )  ) 

in 


i  t  a  !N  -  .  .■  p  ,  ( t )  N  ( t ) 

m  .  ,  .  j-1 ,  ■  ]-l,  ■ 


b.  The  Conditional  Covariances.  For  the  continuing  categories  of 


personnel,  again  for  1  <  i  <_  1  and  t  0  ,  we  have 


Var (N  (t  +  1)  j  N  ( t ) )  =  p.  (t)  (1  -  p.  it)]  N.  (ti 

r+l,m  _  im  ini  i  iv. 


For  the  variances  of  the  accession  quantities,  we  obtain 


Var  (N  (t  +  1)  [  N(t))  =  Var  [E(N.  (t  +  1 )  '  L,  Nit  M  Nit 
im  .  im 


+  F.[Var(N]mU  +  1)  :  h,  N(t)  )  Si! 


=  Var [ E (E (N.  ( t  +  1 )  i  L  ,  L,  N(t))  ]  L,  Nit  . 1  N't 
lm 


+  E{ E (Var (N,  (t  +  1)  j  L  ,  L,  N(t))  '  L,  Nit)' 
lm 


+  Var  (E(N,  (t  +  i)  I  L  ,  L,  Nit))  !.,  N  ( t  i  )  N  ■ 
lm 


Var  [  E  ( n  (t)  1.  |  L,  N(t))  |  N(t)  ] 

m 


+  E  IE (n  (t)  [ 1  -  r  (t) ]  L  |  L,  Nil)) 
m  m  .. 


+  Var  (r-  (t)  L  L,  N(t ) )  |  N(t )  I 
m 


r 
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In  case  i  =  j  =  )  ami  n  /  •  ,  we  i;ei 

i:ov(N)m(t  +  1),  N  (t  +  1)  N  ( t ) ) 

=  Cov  [K(N]m(t  +  1)  i  I.,  N  ( r ) )  ,  I-:  (N  J  (l  +  1)  L,  N(u;  ! 

+  e  «cov(N]m(t  +  i),  n  (t  +  i)  i.,  N(tn  :;ui 

=  Cov  [n  (t)  <  L,  :•  (t)  x  L  N  ( L  )  j 

+  E  { E  (Cov  (N  (t  +  1),  N,  (t  +  I)  I.  ,  1. ,  N'uD  :  !,  'On 

lm  i  .  . 

+  Cov(F.(N  (t  +  1)  :  I*,  I.,  N  ( t ) )  ,  F.(Nj  <t  +  1)  i.X,  i.,  N  ( t  ) 

;  L,  N ( t  >  )  '  N(t)  } 

=  irm(t)  tt a  < c)  a2  Var(L  j  N (t)) 

+  E  {E(-itn(t)  tt£  (t)  L*  |  L,  N  (t) ) 

+  Cov  (tt  (t)  L  ,  TT  ( t )  L  I  L,  N(t)  I  N  ( t ) )  } 
m  x,  _ 

=  tt  (t)  it  (t)  a2  Var  (L  |  N(t)) 
m  x  _ 

+  E  {-  it  (t)  tt  (t)  a  L  +  u  (t)  it  (t)  6  L  |  N(t)} 

m  x  m  x 

’  "„(t>  \(t>  “2  ^  \  Pj-i,t(t>  11  -  pJ-i,e(t>1  Nl-i.i(t) 

+  TT  (t)  Tt  (t)  (B  -  a)  {N  -  I  r  P.  ,  0(t)  N,  .  c  (t )  >  . 

m  £  .  J-^e  j-l.f 
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In  summary,  if  a(t)  and  b(t)  represent  the  (conditional  on  N(t))  mean 
and  variance  of  the  number  of  recruits  required  in  year  t+1  to  achieve 
the  target  end-strength  of  N — i.e.,  if 


a(t)  =  N  -  I  1  p  (t)  N  (t) 
k>2  n  k  i,n 


Mt)  =  k>2  n  Pk-l.n(t)  [1  '  Pk-l,n(t)]  \-l,n(C) 


then  for  i  2,  j  ^2,  and  n  4  l  , 


E(N.  (t  +  1)  N  (t ) )  =  p.  (t)  N  (t) 
im  _  l-l  ,m  l-l, m 


E  (N .  (t  +  1)  |  N  (t ) )  =  tt  (t)  a  a  (t ) 
lm  _  n 


Var  (N  (t  +  1)  I  N (t ) )  =  p  1  (t)  [1  -  p  ,  ( t )  ]  N  . 

im  '  _  i-i ,m  i-l,m  1-1, r 


Var (N  (t  +  1)  j  N (t) )  =  +2( t)  a  b(t) 
lm  _  tn 


+  {m  (t)  [1  -  m  (t>j  a  +  r 2  ( t )  6)  a(t) 
m  m  m 


Cov  (N .  (t  +  1),  N.  (t  +  1)  |  N ( t ) )  =  0 
im  J  ‘ 


Cov  (N  (t  +  1),  N ,  .  ( t  +  1)  |  N(t))  =  (t)  ,i  p.  .  (t) 

1  m  j  i  -  it.  J  ~  r ,  ' 


[1  -  p.  .  (t)  1  N.  ,  (t) 

J-l.k  1 ~ 1 » ‘ 


Cov  (N  (t  +  n,  N.  (t  +  1)  \  N(t))  =  r  (t)  -  (t)  O  b  ( t ) 
lm  1 >  -  m 


+  ti  (t)y.  ( t )  (t-  - 

m  ' 


(t) 


i )  a  ( t  )  . 
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I'he  Fncondit  ional  Means.  For  1  •  i  *"  !  and  t  •  0  , 


we  have 


F  N  (t 

l  +  l  ,m 


+  1) 


P .  (t) 

lm 


EM.  (t) 
im 


For  the  case  i  =  1  and  t  >  0  , 


K  N,  (t  +  1)  =  -  (t)  a  fN  -  Z  :  p.  ,  ,  (t)  F.  N.  .  ...  (t)  -  . 

lm  m  . ^ 2  ,  J ~  1 » 1  J  ~ 1 » ■ 

d.  The  I’ncondit  ional  Covariances.  For  the  continuing',  cate 
of  personnel,  again  for  1  <  i  <  ‘J  and  t  >  0  ,  we  have 


Var  (N .  (t  +  1))  =  E(Var(N.  (t  +  1)  !  N'(t))) 
im  im 

+  Var  (F.  (N  .  (t  +  1)  !  N  ( t ) ) ) 
im 


=  p .  ,  ( t )  (  1  -  p .  ( t )  ]  E  N  ( t ) 

1 i-I, m  l-l, m  l-l ,m 


+  p'.’  ,  (t)  Var  N.  (t) 

i-I ,m  l -1 ,m 


For  the  accession  quantities, 


Var  N .  (t  +  1)  =  F.  (Var  (N.  (t  +  1)  |  N(t))) 
lm  lm  .. 

+  Var (E(N.  (t  +  1 )  1  N(t ) )) 

1  m 

=  ii''(t)  rt*  Eb(t)  +  {i  (t)  [  1  -  ( t  )  ]  l+’ 

m  rn 

+  <T,)  (t)  -x2  Var  a(t)  . 
m 


;or ies 


•  U)  :•  •  Fait) 

I?7 
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To  obtain  covariances,  first  assume  ( i ,  m)  f  (j  ,  .)  and  i  y  2  ,  j  -  2  . 
Then 


Co v  (N .  (t  +  1),  N.  (t  +  1))  =  K (Cov (N .  (t  +  1),  N .  (t  +  1)  !  N(t))) 
ini  j  ..  im  j  >. 


+  Cov (E (N .  (t  +  1)  |  N ( t ) ) ,  E(N .  ,(t  +  1)  i  N ( t) )) 

im  _  j  x  _ 


0  +  C°v(pi-l,m(t)  N'i-1  ,m(t)  '  Pj-l,v  (C)  Nj-l,i(t)) 


p.  ,  (t)  p.  (t)  Cov(N.  (t),  N.  ,(t)). 

l-l, m  j-l,fc  l-l  ,m  j  —  1 ,  £ 


If  i  =  1  and  j  >_  2  ,  we  have 

Cov(Nlm(t  +  1),  N.£(t  +  D)  =  E(Cov(Nlm(t  +  1),  N  (t  +  1)  |  N(t))) 

+  Cov  (E  (N,  (t  +  1)  I  N  ( t ) )  ,  E(N  .  (t  +  1)  I  N  ( t ) ) ) 

im  _  j  x  _ 

=  --n  (t)  a  p  (t)  [  1  -  p  ( t )  ]  E  N  ,  (t) 

m  J  -L  *  v  j  1 ,  >  j  1  ,  » 

+  Cov(i  (t)  a  a  ( t )  ,  p.  ,  (t)  N.  (t)) 
m  i  —  1 ,  *  j  —  1 , 

=  -,Tm(t)  °  pj-l,e(t)  [1  -  pi-l,i(t)]  ENj-l,i(t) 

+  a  TT  (t)  p  „(t)  Cov  (a  ( t )  ,  N.  ,(t))  . 
m  J-l,»  J-l,'- 

Finally,  for  i  =  j  =  1  and  m  /  C.  , 

Cov  (N .  (t  +  1),  N.„(t  +  1)  |  N  ( t ) )  =  F.  (Cov  (N  (t  +  1),  N.  (t  +  1)  |  N(t))) 

J.m  Ik  _  lm  iv 

+  Cov  (E  (N  (t  +  l)  |  N  ( t ) )  ,  E  (N  (t  +  1)  |  N  ( t ) ) ) 

Im  _  1 1 

=  tt  (t)  r  ( t)  (a?  Eb  ( t )  +  (p.  -  a)  E  a(t)] 
m  * 

+  Cov  (  tt  (t)  a.  a  ( t )  ,  TT  ( t )  a  a(t)) 
m  v 
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=  r  (t)  n  (t)  [a-  E  b(t)  +  (f  -  a)  K  a(t)| 
m  < 

+  7!  (t)  v  .(t)  r'  Vac  a(t)  . 
m 


A. 3.  Modeling  of  Parameter  Uncertainty 

Recall  that  the  retention  rates  (p.  (t))  and  the  accession  mix 

lm 

fractions  (tt  (t))  are  inputs.  To  consider  uncertainty  in  these  param¬ 
eters  in  our  analysis,  we  have  ignored  the  possibility  of  their  time 

dependence  and  supposed  they  have  been  estimated  from  actual  data — as 

.1. 

would  be  the  case  if  the  AID  techniques  were  employed.  Thus  the  esti¬ 
mate  p.  of  p.  is  treated  as  a  normal  random  variable  witn  mean  p. 

lm  lm  lm 

and  variance  p.  (1-p,  )/n.  ,  where  n.  is  the  number  of  individuals 
lm  im  ltn  im 

in  YOS  i  and  category  m  whose  stay/leave  behavior  is  observed  in  the 
sample.  Although  0  p  1 ,  we  are  assuming  that  n^  is  sufficiently 
large  so  that  the  normal  distribution  approximates  the  binomial  distri¬ 
bution  adequately. 

To  model  uncertainty  in  the  tt  's,  we  will  require  that  Z  it  =1. 

m  m 

a  m 

Since  each  "estimate"  r  is  also  required  to  lie  in  the  interval  [0,1], 

m 

a  useful  probability  distribution  to  model  the  joint  distribution  of 
(it,,..., it  )  is  the  Dirichlet  distribution.  (See  Do  (Iroot  [8].) 

If  a, . a  (all  nonnegative)  denote  the  parameters  of  this  distri- 

bution,  its  density  is  defined  by 


Automatic  Interaction  Detector,  the  primary  statistical  approach  used 
by  the  Air  Force  for  discerning  behavioral  categories  and  estimating 
corresponding  flow  rates. 
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where  =  I  and  x^,...,x^  are  all  nonnegative,  and 
m 

Moreover,  if  (X^,...,X^)  have  a  Dirichlet  distribution 
be  shown  that 


I  x  =  1 . 

m 

m 

,  then  it  can 


EX  =  a  /a 
m  mo 


a  (a  -a  ) 

Var  X  =  °  °  m- 

m  a  (a  +1) 
o  o 


a  a 

Cov  (X  , X  )  =  -  —  ,  m  4  l 

IB  X.  t  \ 

a  (a  +1) 
o  o 


For  convenience  in  our  computer  program,  we  chose  (n.  )  such  that 

lm 

n.  !Z  n,_  —  tt  ,  for  each  m,  and  let  a  =  n,  .  (This  gives  estimates 
lm  1£  m  m  lm 

for  the  Dirichlet  distribution  parameters  consistent  with  those  ob¬ 
tained  from  using  the  most  recent  year's  observed  accession  mix  as 
data.)  Consequently,  (it  )  has  a  Dirichlet  distribution  with 


E  tt  =  it  =  n,  /E  n. 
m  m  lm  ^  lk 


Var  (ti  ) 
m 


nlm (k  n 1 k~nlm) 

a  n ,.)?a  n.  +1) 
k  lk  k  lk 


Tl  (1-TT  ) 

m  m 
l  n  +1 
k  lk 


71  m  (  1  —  71  ^ 

m  m 


lk 


„  in  U 

Cov  (TT  71  )  =  -  - 

1  '■  (r  n  )2(L  n  +2) 

k.  k  1 K 


I  n,,+l 


it  77  „ 

mi  ,  . 

^  _  f~Z -  .  m  ^  Jl 

—  In,. 


Thus  the  first  and  second  moments  of  (n, , . .  .  ,ir  )  are  approxi- 

1  M 

mately  those  of  estimators  of  the  parameters  of  a  multinomial 

distribution,  with  sample  size  £  n.,  . 

k  ik 

The  above  approach  provides  an  approximate  fee.’  for  the  order 

of  magnitude  of  the  variance  of  the  p.  's  and  the  tt  's  if  they  are 

im  m 

estimated  cross-sectionally  using  a  recent  work  force  of  size  N. 


A. 4.  Incorporating  Parameter  Uncertainty  in  the  Flow  Model 


Section  A. 2  described  the  formulas  used  to  obtain  E  N.  (t)  and 

im 

Cov  (N,  (t) ,  N,„(t))  for  fixed  values  of  p.  and  tr  .  Now  we  denote 
lm  j  &  lm  m 

these  quantities  by 


E(Nim(t)|p,  u)  and  Cov  (N±m(t) ,  N.e(t)|p,  n) 


to  indicate  their  dependence  on  p  and  u.  Since  there  is  uncertainty 
associated  with  estimating  the  true  values  of  p  and  n ,  both  the 
means  and  the  covariances  vary  depending  on  the  estimated  values  of 


p  and  it. 
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Tl>e  mean  E  Nim(t)  is  obtained  by  averaging  E(Nim(t)  |p,  tt)  over  the 
distribution  of  p  and  it;  i.e., 

E  Nlm(t)  =  E[E(Nim(t)  jp,  it)] 

and  Cov  (N,  (t),  N.„(t))  is  obtained  from: 
im  j  Sc 

Var(N  (t))  =  ElVar(N1]n(t)  |p,  n )  ]  +  Var  (EfNMt)  [p,  tt)  ] 

Cov[Nim(t)’  Nj£(t)1  =  E^ov  ’  Nji(t)  •?’  !]) 

+  Cov  [E(N .  ( t )  j  p  ,  tt),  E  (N  ( t )  J  p  ,  ti)] 

These  quantities  are  nonlinear  functions  of  p  and  it,  so  it  is  dif¬ 
ficult  to  obtain  (even  approximate)  analytical  expressions  for  these  quan¬ 
tities,  based  on  the  distributions  of  p  and  it.  To  evaluate  these  (and 
other)  quantities  of  interest,  we  employ  Monte  Carlo  simulations  using 
the  distributions  of  p  and  it.  We  generally  performed  400  replications 
in  our  simulations. 

A.  5 .  A  Brief  Look  at  Co st  Uncerta inty 

Our  model  assumes  that  the  cost  C.  (t)  associated  with  each 

im 

individual  in  the  class  (i,  m)  in  year  t  is  also  random.  The  total 

(random)  cost  for  tile  work  force  In  calendar  year  t  is 

C(t)  =  i  C.  (t)  N.  (t). 

.  im  tm 

l  ,m 

Civen  N(t),  the  (conditional)  expected  cost  for  year  t  is 
K  ( 0  ( t  )  i  N  ( t ) )  =  EC.  (t)N.  (t) 

im  Hi) 


i  .  m 
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and  the  (conditional)  variance  is 


Var  (C(  t)  |  N  ( t ) )  =  Z  CoviC.Jt)  N  (t),  C.^t)  N  (t)  |  p,  ■] 
i.j.m.P.  J 


Z  N  .  ( t) N  .  (t)Cov[C(t)  ,  C.  ( t)  ]  . 
,  im  1  ic  lm  I 

Hence  the  unconditional  expected  cost  is 


EC( t)  =  Z  EC.  (t)EN.  (t) 
im  im 

1  ,m 


and  the  unconditional  variance  is 

Var  C(t)  =  E[Var(C(t)  | N ( t) )  ]  +  Var [E(C( t) | N( t) ) ] 


=  I  E(N  (t)N  (t))Cov[C  (t),  C  (t> ] 

.  .  „  im  i  x.  im  ix. 

+  I  EC.  (t)  •  EC .  ( t) Cov [N .  (t),  N  ( t) ] 

.  im  jP  im  jt 

i,J,m,P 

Z  Cov[Cim(t),  C  (t) )  iCov[N  (t),  N  (£)] 
i,j,in,2  J  J 

+  Z  Cov[Nim(t),  N  ^(t)]  EC  (t)  EC  .  0  (t)  . 


EN.  (t)F.N.  (t)' 
J  m  J 


i,j ,m,P 


UTl 


'j* 


This  shows  how  the  variance  of  the  total  cost  depends  on  the  means  and 
covariance  of  N(t)  and  C(t). 


A.6.  Approximation  of  Means  and  Variances  for  Required  Reenlistment  Rates 

Reenlistment  rates  are  computed  for  the  first-term  work  force  as 

a  whole,  and  also  by  category.  Let  0  <  c  1  (where  c  may  depend  on 

the  category  m) ,  IJ  be  force  size,  and  the  random  variable  X ( t )  be 

either  the  nimber  in  t  lie  entire  fourth  year  group  (IN,  ( t )  )  or  the 

Am 

m 

nimber  in  the  fourth  year  group  for  cell  m  (N^(t)).  The  expected 


required  reenlistment  rate  and  the  variance  ol  Lhe  required  reenlistment 
ratt>  are 

E(cN/X(t))  and  Var  (cN/X(t))  . 

We  wish  to  obtain  approximations  to  these  quantities  by  means 
of  the  first  and  second  moments  of  X(t).  Theoretically ,  since  X(t) 
is  a  discrete  random  variable  with  positive  probability  of  being 
equal  to  zero,  the  above  quantities  are  not  finite.  However,  in 
practice,  we  know  that  our  model  is  only  an  approximation  to  reality, 
and  that  the  likelihood  that  X(t)  =  0  is  quite  small.  Therefore  in 
our  calculations  we  assume  that  l’(X(t)  =  0)  =  0  (i.e.,  we  truncate 
the  distribution  of  X(t)  away  from  zero). 

Let  g  be  a  continuously  differentiable  function.  Let  Y  oe 
any  random  variable,  and  let  ,i  =  E  Y  be  its  mean  value.  Consider 
the  Taylor  series  expansion: 

g(Y)  -  g(u)  +  g '  (p)  [  Y-n  ]  +  I Y  - ,  i  ] 2  +  remainder. 

Assuming  the  remainder  term  can  be  ignored,  we  find 

E  g ( Y )  ~  g(u)  +  g'(p)[u-p]  +  ^4^  E(Y-i;)~ 

-  g(li)  +  Var  Y  • 

Moreover,  by  ignoring  the  second  and  higher  order  terms  in  the 


Taylor  series  expansion,  we  have 
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Var  g (Y)  ~  Var  (g(p)  +  g’ (u) (Y-p) ) 

=  Var  ( g 1 (m) (Y-m) ) 

=  g’(u)2  Var  Y  . 

Now  let  g(y)  =  1/y ,  and  let  Y  =  X/cN.  The  higher  order  terms 
of  the  Taylor  expansion  can  be  dropped  If  they  are  0(N  ’),  where 
>  ■>  1.'  We  know  from  empirical  results  and  from  the  limiting  stationary 
distribution  (see  Section  A. 7  below)  that 

Var  Y  =  OfN"1)  , 

because 


E  X  <*  N 
Var  X  a  N 

I 

Since  Var  Y  =  0(1/N),  the  above  approximations  using  the  Taylor 
expansion  (together  with  truncating  X)  is  justified  (Bickel  and 
Doksum  ( 7 | ) . 
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Vnr  X 
(KX)  J 


and 


<YM) 

\ar (cN/X)  ^  ''ar(X/cN> 

(EX)'* 


(cK) 

(EX) 


2 

A' 


Var 


X. 


In  the  limiting  (stationary)  case  (see  next  section),  X(t)  is  binomial 

with  parameters  N  and  where  f,  =  if  X  =  or  !,  =  X  if  X  =  VN 

™  m  4m 

In  this  case,  EX  =  £,N  and  Var  X  =  1(1  -  i)N.  Thus 


F.  (cN/X) 

Var (cN/X) 


+ 


(1  -  U 
CN 


o 


A.  7.  The  Limiting  Stationary  Distribution  of  the  System 

In  this  section  we  develop  tlie  limiting  distribution  of  the  system 

{ N .  (t):  1  ^  i  <  A,  1  <  m  ^  M*  as  t  +  We  assume  that  the  p .  's 

tm  —  —  —  —  ini 

and  the  ti  's  are  fixed,  and  that  the  r  's  are  parameters  for  the 
m  m 

multinomial  distribution. 

During  the  development  of  the  work,  the  limiting  stationary  case 
was  actually  solved  before  the  equations  of  Section  A. 2  were  developed, 
and  it  subsequently  was  used  as  a  check  on  the  results  from  Section  A. 2. 
We  feel  the  results  of  the  limiting  case  are  sufficiently  interesting 
and  useful  to  he  recorded  here. 
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laider  the  above  assumptions,  ilu  tN.  (it) 

1111 

(a  a  x  M  matrix)  of  the  system  at  t  ime  t  moves  to  tne  •■tat. 

(N.  (t+1))  as  follows: 

im 

N.  (t) 

ini 

( A  7 .  1 )  1.  N  ,  (t+1)  =  i  X  wlte  re  X  a>e  i  mlepeml  on  i  , 

i  +  1  ,m  =) 

identically  distributed  binomial  random  v.irinnle- 

with  parameters  1  ami  p.  ,  tor  1  i 

im 

4  M 

2 .  t'.iven  1.  =  N  -  .  N  .  (L  +  l)  ,  tin  access  ion  ..  i  t 

j  =  2  -1  '■ 

IN.  j  (t  +  1  )  ,  - N,N(t  +  l)'»  has  a  -nit  bomio!  i  :  t  .  : 

with  parameters  1,  and  ,  , 

Thus,  the  transition  from  the  state  at  time  t  to  the  state  at  time 

t  +  1  depends  only  on  the  state  at  Lime  t.  Hence  (N .  (tl)  , 
r  mi 

t  =  0,  1,  2,  3,  . . . ,  is  a  finite  Markov  chain  (indeed,  besides 
having  a  finite  state  space,  the  chain  is  aperiodic  and  irredin  ible) 
with  state  space 

'4  =  f  ( n  )  :  each  integer  n.  d,  n.  N 

im  m  .  i  + 

i 

Tine,  f  rom  the  theorv  of  Mtirkov  chains,  \  f  t  )  ha  a  ■  tat  ion.ir 


I  i  tn  i  t  i  ng  d  i  s  t  r  i  bn  t  i  on  O  (  •  )  on  4  ( i  a  i  '  on  1  1  1 1  )  .  1  b  1 1 
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n  -  ( n  >  .  .  , 

l  in  1  •  i  •  * 

)  -  !'!  M 


In’  .  K  . 

1!"  1  *  I 


11  ,11 


p  (t  +  n 


t  here  is  s  ii  i  s  i  r  it'u  t  i  .pi  "'In*  an  ’  sue*.  that 


i.  A  .  .  l  Hint  =  !’  ,  i '  (  1 1 

it,  p, 

li 


will*  1*1*  t  hi'  suv:  i  :  .*\vr  it  !  n  sui'h  tint  l' 


U’.irv.ni  1  1  i]  1  . 


Also, 


1  itn  P  ( N  (  t )  •-  n )  =  Q(n)  . 
L  >  1 


The  problem  then  is  to  determine  <);  i.e.. 


to  the  parameters  p.  and  r  , 
i  m  ui 


First  ,  t  rnin  t  A7.  1)  and  From  Sent  ion  A.  A  ,  w 


(A/.l)  K(h  (t+1) 

2*1  ,  fil 

and 


IN  . 

i  m 


(t)> 


1-  .  N  . 

'  nil  ir.i 


(O 


F.(K  (t  +  1)  :  L) 

lm 


:  L  . 
in 


how 
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Thus 


(a; 


j  +  J  ,  m 


(HI) 


EN.  (t) 

l'n 


EN,  (1+1)  =  r  EL 

lli\  \\\ 

where  expectation  £(•)  is  lak«  n  vv’ith  respect  to  the  stations] y 
dis  tr  ibul  i  on  Q. 


Let 


* 

r. . 


lim  UN .  (t). 

im 


'..V  have 


EN2m  (t+1)  =  Pjri!  KNlmU>’  a°d  ln  Chc  1  imit  hr,  =  PJm  ’in  ; 


EN,  ( t+3 )  =  p.,  KN (0  -  p.,  p,  EN  (t-H,  and  in  tin.-  limit 
3m  ‘  dm  dm  1  dm  Im  In. 


"3in  P2m  Pli:.  'In  ; 


and  similarly 


(t-d),  and  in  the  limit  -  p  ,  p 

;  afi)  - 


Furthermore , 


n  .  .  (  t  ) 


EN,  (t+1)  =-•  '  El.  r  EiN  - 

lm  m  tn  .  ,  „  ) 

J~2  ' 


r  In  -  >.  e  n  .  < 

"  1=2  x  J' 
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and  in  the  limit 


A  M 


L  =  t.  {N  -  l  l  i  p 
lm  m  j_2  1 1  11 


"  Pj-1,£ 


Thus  since  the  o.  *  s  and  *n  1  s  are  known,  we  have  a  system  of 
•  im  m 

: k 

M  linear  equations  in  M  unknowns  (the  #s) : 


<t«  +  *■  /■  pn 

j -2  £=1 


.  p .  ,  0  =  N ,  l<m<M 
m  *  - 


or 


<A7-5)  «I» +  *.  }■  hi  /■  ’’u 

1=1  3-2 


.  p .  .  .  =  it  N,  l<m<M. 
'j-l.i  m  ’  - 


In  the  remainder  of  this  section  we  show  how  to  solve  these 

* 

simultaneous  equations  for  the  expected  values  £  and  then  obtain  the 
steady-state  probability  distribution  Q(n)  ,  a  multinomial  distribution 

k 

with  parameters  N  and  E.  =  E.  /N. 

ira  im 

First,  if 


b*  =  *  Pl£ 

J=2 


then  (A7.5)  can  be  written  as 


M 

t*  +  "  ?■  e*  b  =  TT  N,  1<HKM 

lm  in  .  ,  H  H  m  - 
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Def ine 

b .  « b  ^  ^  <  b«( 

II  j.  £.  I  n 

*2  bl  *2b2  '  *  ■  1,2bM 

bl  *Mb2  -  *  ‘  1,MbM 

C*T  =  ^*1’  and  117  =  ^1  »  •••’  V*  T,u-‘n  wr'  havc 

(I  +  B)  =  N  •  n  . 

* 

Hence,  solving  for  ,  we  obtain 

4*  =  N(I  +  B)"1  tt  . 

...  M 

LEMMA.  Let  s  =  tr(B)  =  l  tt  b 

1=1  E  C 

Then 

(I  +  B)_1  =  I  -  ts  +  1)_1  B  . 

Proof  (outline) 

th  2  2 

It  is  easy  to  show  that  the  ij  element  of  B  ,  (B  )^,  is 
2 

equal  to  sB...  Hence  B  =  sB.  Therefore 
U 

B2  +  B  =  (s  +  1)  B, 

^We  are  indebted  to  Michael  D.  Miller,  a  Rand  Corporation  colleague, 
for  this  lemma. 
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so 


(s  +  1)  B  -  B  -  B2  +  (s  +  1)  I  =  (s  +  1)  1  . 


Thus 


(B  +  I)  <<s  +  1)  I  -  B)  =  (s  +  1)  I, 


or 


(B  +  I)'-1  =  I  -  (s  +  l)"1  B  . 


Consequently,  we  have 


(*  =  N(I  -  (s  +  1)_1  B)  n 


Thus 


'  *  =  N  IT  -  — T  :  ’•  h 

lm  m  s+1  ni  , 


=  N 


m  s+I 


=  N 


s+i 


Hence  for  all  m,  as  ,  we  have  the  following  straightforward 


expressions  for  evaluating  the  steady-state  expectations  i. .  : 


E  N.  ( t)  -*•  N  *  v  /(s  +  1)  =  f. 
lm  m  lm 


and 


E  Nim(t)  '  N  ’  plm 


P  .  .  H  /  (  S  +  1  ) 

l-l, m  m 
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To  obtain  the  entire  probability  distribution  for  the  steady-state 
Markov  chain,  we  define 


u 


lltl 


T! 

m 


and 


a . 

lm 


lm 


l-l, m  m 


2, 


so  that 


lm 


=  a.  p.  ,  for 

l-l, m  ri-l,m 


2. 


Now  let 


?l*'  “1*/  /,« °i‘* 

so  that  C-  =  S.  ,  p.  ,  •  Then  E  a.  =  1  +  s,  and  the  above 

lm  1-1, m  i-l, m  ^  ^  J* 

shows  that  EN,  (t)  =  Nf.  .  Note  that  each  l,.  >  0  and  E  £.  =  1. 

«  m  i  m  — “  1  111 


un 


lm 


i  ,m 


Consequently,  since  E  N^(t)  -  N,  one  would  guess  that  the 

i,m 

stationary  distribution  of  (N.  (t))  is  multinomial  with  parameters 

im 

N  and  ( F  )  ,  the  latter  representing  "cell  probabilities."  This 
'im 

is  in  fact  the  case. 

THEOREM.  Lot  C(n)  denote  the  multinomial  distribution 
n(lt,  (qm))  on  S  .  Then  f.  =  q. 

Proof  (out. line) 

We  need  to  show  that  C(n)  has  the  steady-state  property  that 


f,(n')  -  E  P  ,t;(r)  ,  for  all  n,  n'  e  S  .  Since  the  steady-state 


n,n 


distribution  Q(n)  is  unique,  we  will  thin  have  Q(n)  -  !,( n)  . 


A 
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Now 


£  (n) 


N! 


n  n 
i,m 


im! 


’im 


and  £  (n* ) 


N!  -r-j-  n  £.  n 

'  5  11 


l  ,m 


II  n ;  !  .  nn 

.  im  i,m 


Note  that  in  order  for  P  ,  to  be  positive,  we  must  have 

n  ,n 

n.  >  n' . , ,  ,  l<m<M,  1< i <3 .  Moreover, 

im  -  j+l, ro’ - ’ 


<m-6)  p„,„' . ;  i  h* 
‘  — 1 


p  J+I’m 
jm 


n.  -n'.,, 

(1  -  p  )  Jm  J+l’m 


from  fact  that  attritions  are 
Bernoulli  trials 


(In'  ) !  nV 

-VV3-  n  TT  lni 

Hn  m 

m  1  m  m 


Also  recall  that  £.,,  =  p  . 

1+1, m  \jm 

Giver  n' ,  (A7 .6) ,  we  get 


from  fact  that  accession  mix 
is  multinomial 


E  «">  Pn  ,n'  '  1 
n  n 


N! 


M  U  n, 


n  n  e 


km' 


n  n  n.  !  .  ,  , 

im  m=l  k=l 

Vm  i 


km 


l  _ 

m  j=l  n’  ,,  !(n.  -n' .  )! 

i  J  j+l,m  jm  j+l,ra 


n .  ! 

jsl 


n  1 n ,  "~n  , . « 


<E  "V 


m 


n  n  1 _ 


Ira 


n  it 

in  ra 


lm 


im 


TO 
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=  l 


N! 


II  n  n.  .' 
n  .  im 
.mi 


<£n’lm>! 

m _ 

Iln’.  ! 

lro 

m 


M 

•  n 

m=l 


An.  3  n .  ! 

n  Lkm  n  - *jy — — v — 

k=l  “  j  =  l  n  j+l,m‘  njra  n  j+1 


For  each  m. 


n  . , ,  n .  -n  . . .  n 

1+1, m  .  lm  i+l, m  lm 

•  p.  J  ’  (1-p.  )  1  n 

jm  *jm  in 


n,  n~  n.  n.  -n  „ 

lm  2m  Am  _  Ira  2m  ..... 

Hm  42m  Hm  4m  ’  4m  4m 


n  _  n_  -n  - 
2m  ,  2m  3m 


n\  -n'  n'  n. 

3m  3b  Am  Am  Am 

4m  * '  ‘  4m  *  4ra  4m 


Since  p.  £.  =  .we  have 

jm  Jm  j+l,m 


n1  .  . ,  1+1  ,m  n*  .  , , 

j+l,m  J  ’  3+1, m 

P  •  V  •  •  . 

jm  jm 


’j+1  ,m  ’ 


hence  we  obtain 


(A7.7) 


m,  (In' .  )! 


A  “  H  n.  ! 

n  n  i  i  n  Ain 
U  Tl  n  .  !  _  m 

•  i  3m 
m  j  =  l 


„  1 

11  4,  (n.  -n'  '  TT 

m  3=1  jm  3+1, m 


A  n 


3  n ,  -n 1 


•  (n  n  r  km)  (n  n  c.j,n  j+1’m) 


m  k=2 


’km 


m  j  =  l 


3>n 
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Thus  the  last  sum  in  (A7.7)  becomes 


n  tt 

n 

m 


lm 


.  n 


3 

n 

m  j=l 


ft.  ! 
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But  this  sum  is  the  multinomial  expansion  of 
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Thus  we  get 
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Hence  we  obtain 
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Thus  the  proof  is  complete. 


Appendix  B 


TABLED  COMPUTATIONS  OF  KEY  OUTPUT  QUANTITIES 


✓ 


i 


UNCERTAINTY  TYPES:  Fixed  P'8 


nrr-ovt'T  <j  it 


TABLE  B2 .  ACCESSIONS  (LARGE  CELL) 


UNCERTAINTY  TYPES: 


TABLE  B4 .  TOTAL  FOURTH-YEAR-GROUP  SIZE 


TABLE  B5 .  FOl’RTH-YEAR-CROl'P  SIZE  (LARGE  CELL) 


TABLE  B6 .  FOURTH-YEAR-GROUP  SIZE  (SMALL  CELL) 


TABLE  B7 .  TOTAL  REQUIRED  REENLISTMENT  RATE 


TABLE  B8.  REQUIRED  REENLISTMENT  RATE  (LARGE  CELL) 


TABLE  B9.  REQUIRED  REENLISTMENT  RATE  (SMALL  CELL) 


TABLE  BIO.  TOTAL  COSTS 
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Appendix  C 

DERIVATION  FOR  NUMBER  OF  REQUIRED  RECRUITS 


In  this  Appendix  we  derive  the  value  a  required  such  that 

P(R  =  c/Y  <  r  |A  =  a)  >  b,  where  A  =  a  is  the  number  of  people  that 

should  be  recruited  to  assure  a  probability  of  at  least  b  that  a 
reenlistment  rate  R  no  higher  than  r  will  be  required  to  reenlist 
c  of  those  people  for  the  career  force,  and  where  p  is  the  probability 
that  an  individual  recruit  makes  it  through  his  initial  four-year 
obligation. 

If  R  is  the  (random)  required  reenlistment  rate  and  Y  is  the 

mmber  of  people  remaining  after  four  years  of  service  of  those  a  who 

are  recruited  initially,  then  R  =  c/Y,  and  Y  is  binomial  with  parameters 

a  and  p.  Thus  Y  has  mean  pa  and  variance  p(l  -  p)a.  Then 

Z  =  (Y  -  pal/Z^Of-pTa  is  approximately  a  standard  normal  (mean  0, 

variance  1)  random  variable,  and  we  let  be  the  upper  quantile 

point  such  that  P(Z  -•  z,  )  =  b. 

-  b 

Since 

P(R  <  r  |  A  =  a)  =  P(c/Y  -  r|A  =  a) 

=  P(c/r  <  YlA  =  a) 
c 

-  -  ap 

=  P(Z  >  - - (a  =  a)  , 

4>(l-p)a 
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we  obtain 


-  ap 


/p  (l-p)a 


Solving  for  a,  we  obtain  a  quadratic  equation  in  a,  and  its 


2pc/r  +  z  2  p(l-p)  +  /[2pc/v  +  z  2  p(l-p)]2  -  4p2c 


a  = 


2P 
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